IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v18y2021i2p504-d477676.html
   My bibliography  Save this article

Assessment of Grouped Weighted Quantile Sum Regression for Modeling Chemical Mixtures and Cancer Risk

Author

Listed:
  • David C. Wheeler

    (Department of Biostatistics, School of Medicine, Virginia Commonwealth University, Richmond, VA 23298-0032, USA)

  • Salem Rustom

    (Department of Biostatistics, School of Medicine, Virginia Commonwealth University, Richmond, VA 23298-0032, USA)

  • Matthew Carli

    (Department of Biostatistics, School of Medicine, Virginia Commonwealth University, Richmond, VA 23298-0032, USA)

  • Todd P. Whitehead

    (Division of Epidemiology/Biostatistics, University of California, Berkeley School of Public Health, Berkeley, CA 94704-7394, USA)

  • Mary H. Ward

    (Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20850, USA)

  • Catherine Metayer

    (Division of Epidemiology/Biostatistics, University of California, Berkeley School of Public Health, Berkeley, CA 94704-7394, USA)

Abstract

Individuals are exposed to a large number of diverse environmental chemicals simultaneously and the evaluation of multiple chemical exposures is important for identifying cancer risk factors. The measurement of a large number of chemicals (the exposome) in epidemiologic studies is allowing for a more comprehensive assessment of cancer risk factors than was done in earlier studies that focused on only a few chemicals. Empirical evidence from epidemiologic studies shows that chemicals from different chemical classes have different magnitudes and directions of association with cancers. Given increasing data availability, there is a need for the development and assessment of statistical methods to model environmental cancer risk that considers a large number of diverse chemicals with different effects for different chemical classes. The method of grouped weighted quantile sum (GWQS) regression allows for multiple groups of chemicals to be considered in the model such that different magnitudes and directions of associations are possible for each group of chemicals. In this paper, we assessed the ability of GWQS regression to estimate exposure effects for multiple chemical groups and correctly identify important chemicals in each group using a simulation study. We compared the performance of GWQS regression with WQS regression, the least absolute shrinkage and selection operator (lasso), and the group lasso in estimating exposure effects and identifying important chemicals. The simulation study results demonstrate that GWQS is an effective method for modeling exposure to multiple groups of chemicals and compares favorably with other methods used in mixture analysis. As an application, we used GWQS regression in the California Childhood Leukemia Study (CCLS), a population-based case-control study of childhood leukemia in California to estimate exposure effects for many chemical classes while also adjusting for demographic factors. The CCLS analysis found evidence of a positive association between exposure to the herbicide dacthal and an increased risk of childhood leukemia.

Suggested Citation

  • David C. Wheeler & Salem Rustom & Matthew Carli & Todd P. Whitehead & Mary H. Ward & Catherine Metayer, 2021. "Assessment of Grouped Weighted Quantile Sum Regression for Modeling Chemical Mixtures and Cancer Risk," IJERPH, MDPI, vol. 18(2), pages 1-20, January.
  • Handle: RePEc:gam:jijerp:v:18:y:2021:i:2:p:504-:d:477676
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/18/2/504/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/18/2/504/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Whitehead, T.P. & Metayer, C. & Ward, M.H. & Colt, J.S. & Gunier, R.B. & Deziel, N.C. & Rappaport, S.M. & Buffler, P.A., 2014. "Persistent organic pollutants in dust from older homes: Learning from lead," American Journal of Public Health, American Public Health Association, vol. 104(7), pages 1320-1326.
    2. Goodarz Danaei & Eric L Ding & Dariush Mozaffarian & Ben Taylor & Jürgen Rehm & Christopher J L Murray & Majid Ezzati, 2009. "The Preventable Causes of Death in the United States: Comparative Risk Assessment of Dietary, Lifestyle, and Metabolic Risk Factors," PLOS Medicine, Public Library of Science, vol. 6(4), pages 1-23, April.
    3. Ming Yuan & Yi Lin, 2006. "Model selection and estimation in regression with grouped variables," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(1), pages 49-67, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Krista Schroeder & Levent Dumenci & David B. Sarwer & Jennie G. Noll & Kevin A. Henry & Shakira F. Suglia & Christine M. Forke & David C. Wheeler, 2022. "The Intersection of Neighborhood Environment and Adverse Childhood Experiences: Methods for Creation of a Neighborhood ACEs Index," IJERPH, MDPI, vol. 19(13), pages 1-19, June.
    2. David C. Wheeler & Salem Rustom & Matthew Carli & Todd P. Whitehead & Mary H. Ward & Catherine Metayer, 2021. "Bayesian Group Index Regression for Modeling Chemical Mixtures and Cancer Risk," IJERPH, MDPI, vol. 18(7), pages 1-19, March.
    3. Matthew Carli & Mary H. Ward & Catherine Metayer & David C. Wheeler, 2022. "Imputation of Below Detection Limit Missing Data in Chemical Mixture Analysis with Bayesian Group Index Regression," IJERPH, MDPI, vol. 19(3), pages 1-17, January.
    4. Alexis E. Zavez & Emeir M. McSorley & Alison J. Yeates & Sally W. Thurston, 2023. "A Bayesian Partial Membership Model for Multiple Exposures with Uncertain Group Memberships," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 28(3), pages 377-400, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tutz, Gerhard & Pößnecker, Wolfgang & Uhlmann, Lorenz, 2015. "Variable selection in general multinomial logit models," Computational Statistics & Data Analysis, Elsevier, vol. 82(C), pages 207-222.
    2. Charles Courtemanche & James Marton & Benjamin Ukert & Aaron Yelowitz & Daniela Zapata, 2018. "Early Effects of the Affordable Care Act on Health Care Access, Risky Health Behaviors, and Self‐Assessed Health," Southern Economic Journal, John Wiley & Sons, vol. 84(3), pages 660-691, January.
    3. Jun Yan & Jian Huang, 2012. "Model Selection for Cox Models with Time-Varying Coefficients," Biometrics, The International Biometric Society, vol. 68(2), pages 419-428, June.
    4. Ye, Ya-Fen & Shao, Yuan-Hai & Deng, Nai-Yang & Li, Chun-Na & Hua, Xiang-Yu, 2017. "Robust Lp-norm least squares support vector regression with feature selection," Applied Mathematics and Computation, Elsevier, vol. 305(C), pages 32-52.
    5. Guillaume Sagnol & Edouard Pauwels, 2019. "An unexpected connection between Bayes A-optimal designs and the group lasso," Statistical Papers, Springer, vol. 60(2), pages 565-584, April.
    6. Bakalli, Gaetan & Guerrier, Stéphane & Scaillet, Olivier, 2023. "A penalized two-pass regression to predict stock returns with time-varying risk premia," Journal of Econometrics, Elsevier, vol. 237(2).
    7. Shuichi Kawano, 2014. "Selection of tuning parameters in bridge regression models via Bayesian information criterion," Statistical Papers, Springer, vol. 55(4), pages 1207-1223, November.
    8. Peng, Heng & Lu, Ying, 2012. "Model selection in linear mixed effect models," Journal of Multivariate Analysis, Elsevier, vol. 109(C), pages 109-129.
    9. Yize Zhao & Matthias Chung & Brent A. Johnson & Carlos S. Moreno & Qi Long, 2016. "Hierarchical Feature Selection Incorporating Known and Novel Biological Information: Identifying Genomic Features Related to Prostate Cancer Recurrence," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1427-1439, October.
    10. G. Aneiros & P. Vieu, 2016. "Sparse nonparametric model for regression with functional covariate," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 28(4), pages 839-859, October.
    11. Dong, C. & Li, S., 2021. "Specification Lasso and an Application in Financial Markets," Cambridge Working Papers in Economics 2139, Faculty of Economics, University of Cambridge.
    12. Lam, Clifford, 2008. "Estimation of large precision matrices through block penalization," LSE Research Online Documents on Economics 31543, London School of Economics and Political Science, LSE Library.
    13. Gregory Vaughan & Robert Aseltine & Kun Chen & Jun Yan, 2017. "Stagewise generalized estimating equations with grouped variables," Biometrics, The International Biometric Society, vol. 73(4), pages 1332-1342, December.
    14. Pradeep Ravikumar & John Lafferty & Han Liu & Larry Wasserman, 2009. "Sparse additive models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(5), pages 1009-1030, November.
    15. Yucheng Yang & Zhong Zheng & Weinan E, 2020. "Interpretable Neural Networks for Panel Data Analysis in Economics," Papers 2010.05311, arXiv.org, revised Nov 2020.
    16. Devijver, Emilie, 2017. "Joint rank and variable selection for parsimonious estimation in a high-dimensional finite mixture regression model," Journal of Multivariate Analysis, Elsevier, vol. 157(C), pages 1-13.
    17. Zhang, Tao & Zhang, Qingzhao & Wang, Qihua, 2014. "Model detection for functional polynomial regression," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 183-197.
    18. Dusanee Kesavayuth & Prompong Shangkhum & Vasileios Zikos, 2022. "Well-Being and Physical Health: A Mediation Analysis," Journal of Happiness Studies, Springer, vol. 23(6), pages 2849-2879, August.
    19. Brian L. Rostron & Cindy M. Chang & Brittny C. Davis Lynn & Chunfeng Ren & Esther Salazar & Bridget K. Ambrose, 2022. "The contribution of smoking-attributable mortality to differences in mortality and life expectancy among US African-American and white adults, 2000–2019," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 46(31), pages 905-918.
    20. Madeleine Cule & Richard Samworth & Michael Stewart, 2010. "Maximum likelihood estimation of a multi‐dimensional log‐concave density," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(5), pages 545-607, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:18:y:2021:i:2:p:504-:d:477676. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.