IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v12y2015i9p10648-10661d54931.html
   My bibliography  Save this article

A Bayesian Approach to Account for Misclassification and Overdispersion in Count Data

Author

Listed:
  • Wenqi Wu

    (Department of Statistical Science, Baylor University, One Bear Place #97140, Waco, TX, 76706, USA)

  • James Stamey

    (Department of Statistical Science, Baylor University, One Bear Place #97140, Waco, TX, 76706, USA)

  • David Kahle

    (Department of Statistical Science, Baylor University, One Bear Place #97140, Waco, TX, 76706, USA)

Abstract

Count data are subject to considerable sources of what is often referred to as non-sampling error. Errors such as misclassification, measurement error and unmeasured confounding can lead to substantially biased estimators. It is strongly recommended that epidemiologists not only acknowledge these sorts of errors in data, but incorporate sensitivity analyses into part of the total data analysis. We extend previous work on Poisson regression models that allow for misclassification by thoroughly discussing the basis for the models and allowing for extra-Poisson variability in the form of random effects. Via simulation we show the improvements in inference that are brought about by accounting for both the misclassification and the overdispersion.

Suggested Citation

  • Wenqi Wu & James Stamey & David Kahle, 2015. "A Bayesian Approach to Account for Misclassification and Overdispersion in Count Data," IJERPH, MDPI, vol. 12(9), pages 1-14, August.
  • Handle: RePEc:gam:jijerp:v:12:y:2015:i:9:p:10648-10661:d:54931
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/12/9/10648/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/12/9/10648/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Igor Burstyn & Yunwen Yang & A. Robert Schnatter, 2014. "Effects of Non-Differential Exposure Misclassification on False Conclusions in Hypothesis-Generating Studies," IJERPH, MDPI, vol. 11(10), pages 1-16, October.
    2. Paulino, Carlos Daniel & Silva, Giovani & Alberto Achcar, Jorge, 2005. "Bayesian analysis of correlated misclassified binary data," Computational Statistics & Data Analysis, Elsevier, vol. 49(4), pages 1120-1131, June.
    3. Helmut Küchenhoff & Samuel M. Mwalili & Emmanuel Lesaffre, 2006. "A General Method for Dealing with Misclassification in Regression: The Misclassification SIMEX," Biometrics, The International Biometric Society, vol. 62(1), pages 85-96, March.
    4. Stuart Batterman & Janet Burke & Vlad Isakov & Toby Lewis & Bhramar Mukherjee & Thomas Robins, 2014. "A Comparison of Exposure Metrics for Traffic-Related Air Pollutants: Application to Epidemiology Studies in Detroit, Michigan," IJERPH, MDPI, vol. 11(9), pages 1-25, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Frénay, Benoît & Doquire, Gauthier & Verleysen, Michel, 2014. "Estimating mutual information for feature selection in the presence of label noise," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 832-848.
    2. Igor Burstyn & Francesco Barone-Adesi & Frank de Vocht & Paul Gustafson, 2019. "What to Do When Accumulated Exposure Affects Health but Only Its Duration Was Measured? A Case of Linear Regression," IJERPH, MDPI, vol. 16(11), pages 1-16, May.
    3. Michelle Snyder & Saravanan Arunachalam & Vlad Isakov & Kevin Talgo & Brian Naess & Alejandro Valencia & Mohammad Omary & Neil Davis & Rich Cook & Adel Hanna, 2014. "Creating Locally-Resolved Mobile-Source Emissions Inputs for Air Quality Modeling in Support of an Exposure Study in Detroit, Michigan, USA," IJERPH, MDPI, vol. 11(12), pages 1-28, December.
    4. Engzell, Per, 2017. "What Do Books in the Home Proxy For? A Cautionary Tale," Working Paper Series 1/2016, Stockholm University, Swedish Institute for Social Research.
    5. Sajad Shojaee & Nastaran Hajizadeh & Hadis Najafimehr & Luca Busani & Mohamad Amin Pourhoseingholi & Ahmad Reza Baghestani & Maryam Nasserinejad & Sara Ashtari & Mohammad Reza Zali, 2018. "Bayesian adjustment for trend of colorectal cancer incidence in misclassified registering across Iranian provinces," PLOS ONE, Public Library of Science, vol. 13(12), pages 1-10, December.
    6. Muff, Stefanie & Ott, Manuela & Braun, Julia & Held, Leonhard, 2017. "Bayesian two-component measurement error modelling for survival analysis using INLA—A case study on cardiovascular disease mortality in Switzerland," Computational Statistics & Data Analysis, Elsevier, vol. 113(C), pages 177-193.
    7. Mengke Qiao & Ke-Wei Huang, 2021. "Correcting Misclassification Bias in Regression Models with Variables Generated via Data Mining," Information Systems Research, INFORMS, vol. 32(2), pages 462-480, June.
    8. Daniela Dias & Oxana Tchepel, 2018. "Spatial and Temporal Dynamics in Air Pollution Exposure Assessment," IJERPH, MDPI, vol. 15(3), pages 1-23, March.
    9. Saravanan Arunachalam & Alejandro Valencia & Yasuyuki Akita & Marc L. Serre & Mohammad Omary & Valerie Garcia & Vlad Isakov, 2014. "A Method for Estimating Urban Background Concentrations in Support of Hybrid Air Pollution Modeling for Environmental Health Studies," IJERPH, MDPI, vol. 11(10), pages 1-19, October.
    10. repec:ebl:ecbull:v:3:y:2007:i:36:p:1-19 is not listed on IDEAS
    11. Surupa Roy & Kalyan Das & Angshuman Sarkar, 2013. "Analysis of binary data with the possibility of wrong ascertainment," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 67(3), pages 293-310, August.
    12. repec:iab:iabfme:201011(en is not listed on IDEAS
    13. Simon Cheng & Yingmei Xi & Ming-Hui Chen, 2008. "A New Mixture Model for Misclassification With Applications for Survey Data," Sociological Methods & Research, , vol. 37(1), pages 75-104, August.
    14. Mochen Yang & Edward McFowland & Gordon Burtch & Gediminas Adomavicius, 2022. "Achieving Reliable Causal Inference with Data-Mined Variables: A Random Forest Approach to the Measurement Error Problem," INFORMS Joural on Data Science, INFORMS, vol. 1(2), pages 138-155, October.
    15. Mochen Yang & Edward McFowland III & Gordon Burtch & Gediminas Adomavicius, 2020. "Achieving Reliable Causal Inference with Data-Mined Variables: A Random Forest Approach to the Measurement Error Problem," Papers 2012.10790, arXiv.org.
    16. Vasilis Kazakos & Zhiwen Luo & Ian Ewart, 2020. "Quantifying the Health Burden Misclassification from the Use of Different PM 2.5 Exposure Tier Models: A Case Study of London," IJERPH, MDPI, vol. 17(3), pages 1-21, February.
    17. Mochen Yang & Gediminas Adomavicius & Gordon Burtch & Yuqing Rena, 2018. "Mind the Gap: Accounting for Measurement Error and Misclassification in Variables Generated via Data Mining," Information Systems Research, INFORMS, vol. 29(1), pages 4-24, March.
    18. Hong Li & Qifan Song & Jianxi Su, 2021. "Robust estimates of insurance misrepresentation through kernel quantile regression mixtures," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 88(3), pages 625-663, September.
    19. Li‐Pang Chen & Grace Y. Yi, 2021. "Analysis of noisy survival data with graphical proportional hazards measurement error models," Biometrics, The International Biometric Society, vol. 77(3), pages 956-969, September.
    20. Sheena E. Martenies & Chad W. Milando & Guy O. Williams & Stuart A. Batterman, 2017. "Disease and Health Inequalities Attributable to Air Pollutant Exposure in Detroit, Michigan," IJERPH, MDPI, vol. 14(10), pages 1-24, October.
    21. Paul S. Albert & Aiyi Liu & Tonja Nansel, 2014. "Efficient logistic regression designs under an imperfect population identifier," Biometrics, The International Biometric Society, vol. 70(1), pages 175-184, March.
    22. Gordon Burtch & Edward McFowland III & Mochen Yang & Gediminas Adomavicius, 2023. "EnsembleIV: Creating Instrumental Variables from Ensemble Learners for Robust Statistical Inference," Papers 2303.02820, arXiv.org.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:12:y:2015:i:9:p:10648-10661:d:54931. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.