IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v19y2022i3p1369-d734788.html
   My bibliography  Save this article

Imputation of Below Detection Limit Missing Data in Chemical Mixture Analysis with Bayesian Group Index Regression

Author

Listed:
  • Matthew Carli

    (Department of Biostatistics, School of Medicine, Virginia Commonwealth University, One Capitol Square, 830 East Main Street, Richmond, VA 23298, USA)

  • Mary H. Ward

    (Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20850, USA)

  • Catherine Metayer

    (School of Public Health, University of California Berkeley, Berkeley, CA 94704, USA)

  • David C. Wheeler

    (Department of Biostatistics, School of Medicine, Virginia Commonwealth University, One Capitol Square, 830 East Main Street, Richmond, VA 23298, USA)

Abstract

There is growing scientific interest in identifying the multitude of chemical exposures related to human diseases through mixture analysis. In this paper, we address the issue of below detection limit (BDL) missing data in mixture analysis using Bayesian group index regression by treating both regression effects and missing BDL observations as parameters in a model estimated through a Markov chain Monte Carlo algorithm that we refer to as pseudo-Gibbs imputation. We compare this with other Bayesian imputation methods found in the literature (Multiple Imputation by Chained Equations and Sequential Full Bayes imputation) as well as with a non-Bayesian single-imputation method. To evaluate our proposed method, we conduct simulation studies with varying percentages of BDL missingness and strengths of association. We apply our method to the California Childhood Leukemia Study (CCLS) to estimate concentrations of chemicals in house dust in a mixture analysis of potential environmental risk factors for childhood leukemia. Our results indicate that pseudo-Gibbs imputation has superior power for exposure effects and sensitivity for identifying individual chemicals at high percentages of BDL missing data. In the CCLS, we found a significant positive association between concentrations of polycyclic aromatic hydrocarbons (PAHs) in homes and childhood leukemia as well as significant positive associations for polychlorinated biphenyls (PCBs) and herbicides among children from the highest quartile of household income. In conclusion, pseudo-Gibbs imputation addresses a commonly encountered problem in environmental epidemiology, providing practitioners the ability to jointly estimate the effects of multiple chemical exposures with high levels of BDL missingness.

Suggested Citation

  • Matthew Carli & Mary H. Ward & Catherine Metayer & David C. Wheeler, 2022. "Imputation of Below Detection Limit Missing Data in Chemical Mixture Analysis with Bayesian Group Index Regression," IJERPH, MDPI, vol. 19(3), pages 1-17, January.
  • Handle: RePEc:gam:jijerp:v:19:y:2022:i:3:p:1369-:d:734788
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/19/3/1369/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/19/3/1369/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. van Buuren, Stef & Groothuis-Oudshoorn, Karin, 2011. "mice: Multivariate Imputation by Chained Equations in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 45(i03).
    2. David C. Wheeler & Salem Rustom & Matthew Carli & Todd P. Whitehead & Mary H. Ward & Catherine Metayer, 2021. "Assessment of Grouped Weighted Quantile Sum Regression for Modeling Chemical Mixtures and Cancer Risk," IJERPH, MDPI, vol. 18(2), pages 1-20, January.
    3. David J. Spiegelhalter & Nicola G. Best & Bradley P. Carlin & Angelika Van Der Linde, 2002. "Bayesian measures of model complexity and fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 583-639, October.
    4. Gelman A., 2004. "Parameterization and Bayesian Modeling," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 537-545, January.
    5. David C. Wheeler & Salem Rustom & Matthew Carli & Todd P. Whitehead & Mary H. Ward & Catherine Metayer, 2021. "Bayesian Group Index Regression for Modeling Chemical Mixtures and Cancer Risk," IJERPH, MDPI, vol. 18(7), pages 1-19, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. David C. Wheeler & Joseph Boyle & Matt Carli & Mary H. Ward & Catherine Metayer, 2023. "Neighborhood Deprivation, Indoor Chemical Concentrations, and Spatial Risk for Childhood Leukemia," IJERPH, MDPI, vol. 20(4), pages 1-15, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Leonardo Oliveira Martins & Hirohisa Kishino, 2010. "Distribution of distances between topologies and its effect on detection of phylogenetic recombination," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 62(1), pages 145-159, February.
    2. Krista Schroeder & Levent Dumenci & David B. Sarwer & Jennie G. Noll & Kevin A. Henry & Shakira F. Suglia & Christine M. Forke & David C. Wheeler, 2022. "The Intersection of Neighborhood Environment and Adverse Childhood Experiences: Methods for Creation of a Neighborhood ACEs Index," IJERPH, MDPI, vol. 19(13), pages 1-19, June.
    3. David Kaplan & Jianshen Chen, 2012. "A Two-Step Bayesian Approach for Propensity Score Analysis: Simulations and Case Study," Psychometrika, Springer;The Psychometric Society, vol. 77(3), pages 581-609, July.
    4. Faisal Maqbool Zahid & Shahla Faisal & Christian Heumann, 2020. "Variable selection techniques after multiple imputation in high-dimensional data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 29(3), pages 553-580, September.
    5. Noémi Kreif & Richard Grieve & Iván Díaz & David Harrison, 2015. "Evaluation of the Effect of a Continuous Treatment: A Machine Learning Approach with an Application to Treatment for Traumatic Brain Injury," Health Economics, John Wiley & Sons, Ltd., vol. 24(9), pages 1213-1228, September.
    6. Buddhavarapu, Prasad & Bansal, Prateek & Prozzi, Jorge A., 2021. "A new spatial count data model with time-varying parameters," Transportation Research Part B: Methodological, Elsevier, vol. 150(C), pages 566-586.
    7. Mumtaz, Haroon & Theodoridis, Konstantinos, 2017. "Common and country specific economic uncertainty," Journal of International Economics, Elsevier, vol. 105(C), pages 205-216.
    8. Jesse Elliott & Zemin Bai & Shu-Ching Hsieh & Shannon E Kelly & Li Chen & Becky Skidmore & Said Yousef & Carine Zheng & David J Stewart & George A Wells, 2020. "ALK inhibitors for non-small cell lung cancer: A systematic review and network meta-analysis," PLOS ONE, Public Library of Science, vol. 15(2), pages 1-18, February.
    9. Christina Leuker & Thorsten Pachur & Ralph Hertwig & Timothy J. Pleskac, 2019. "Do people exploit risk–reward structures to simplify information processing in risky choice?," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 5(1), pages 76-94, August.
    10. Francois Olivier & Laval Guillaume, 2011. "Deviance Information Criteria for Model Selection in Approximate Bayesian Computation," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-25, July.
    11. Abhilash Bandam & Eedris Busari & Chloi Syranidou & Jochen Linssen & Detlef Stolten, 2022. "Classification of Building Types in Germany: A Data-Driven Modeling Approach," Data, MDPI, vol. 7(4), pages 1-23, April.
    12. Raggi, Davide & Bordignon, Silvano, 2012. "Long memory and nonlinearities in realized volatility: A Markov switching approach," Computational Statistics & Data Analysis, Elsevier, vol. 56(11), pages 3730-3742.
    13. Angelica Gianfreda & Francesco Ravazzolo & Luca Rossini, 2023. "Large Time‐Varying Volatility Models for Hourly Electricity Prices," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 85(3), pages 545-573, June.
    14. Rubio, F.J. & Steel, M.F.J., 2011. "Inference for grouped data with a truncated skew-Laplace distribution," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3218-3231, December.
    15. Olawale Awe O. & Adedayo Adepoju A., 2018. "Modified Recursive Bayesian Algorithm For Estimating Time-Varying Parameters In Dynamic Linear Models," Statistics in Transition New Series, Polish Statistical Association, vol. 19(2), pages 258-293, June.
    16. Boonstra Philip S. & Little Roderick J.A. & West Brady T. & Andridge Rebecca R. & Alvarado-Leiton Fernanda, 2021. "A Simulation Study of Diagnostics for Selection Bias," Journal of Official Statistics, Sciendo, vol. 37(3), pages 751-769, September.
    17. Alessandri, Piergiorgio & Mumtaz, Haroon, 2019. "Financial regimes and uncertainty shocks," Journal of Monetary Economics, Elsevier, vol. 101(C), pages 31-46.
    18. Padilla, Juan L. & Azevedo, Caio L.N. & Lachos, Victor H., 2018. "Multidimensional multiple group IRT models with skew normal latent trait distributions," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 250-268.
    19. Svetlana V. Tishkovskaya & Paul G. Blackwell, 2021. "Bayesian estimation of heterogeneous environments from animal movement data," Environmetrics, John Wiley & Sons, Ltd., vol. 32(6), September.
    20. David Macro & Jeroen Weesie, 2016. "Inequalities between Others Do Matter: Evidence from Multiplayer Dictator Games," Games, MDPI, vol. 7(2), pages 1-23, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:19:y:2022:i:3:p:1369-:d:734788. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.