IDEAS home Printed from https://ideas.repec.org/a/spr/jagbes/v28y2023i3d10.1007_s13253-023-00535-4.html
   My bibliography  Save this article

A Shared Latent Process Model to Correct for Preferential Sampling in Disease Surveillance Systems

Author

Listed:
  • Brian Conroy

    (Emory University)

  • Lance A. Waller

    (Emory University)

  • Ian D. Buller

    (Emory University)

  • Gregory M. Hacker

    (California Department of Public Health Vector-Borne Disease Section)

  • James R. Tucker

    (California Department of Public Health Vector-Borne Disease Section)

  • Mark G. Novak

    (California Department of Public Health Vector-Borne Disease Section)

Abstract

Disease surveillance systems are crucial to monitor and predict outbreaks, epidemics and pandemics, as well as to understand the dynamics and trends of diseases over space and time. For zoonotic diseases, i.e., diseases that spread from animals to humans, surveillance systems often rely on complex data collection mechanisms which present particular challenges to the statistician, including sampling processes that commonly violate key assumptions of standard statistical methods. One such mechanism is preferential sampling, referring to a stochastic dependency between a spatial process of interest and the locations at which it is observed, commonly arising out of practical considerations related to a limited sampling budget and a rare outcome. While this sampling strategy can lead to considerably biased spatial predictions, few solutions to address preferential sampling have been proposed in the context of disease surveillance. We propose a novel approach to correct for preferential sampling in disease surveillance applications and show by simulation the practical benefits of reduced bias in parameter estimates and greater accuracy of the estimated risk surface. We conclude with an application of the model to a disease surveillance dataset targeting plague (Yersinia pestis) in the sylvatic rodent populations in California.

Suggested Citation

  • Brian Conroy & Lance A. Waller & Ian D. Buller & Gregory M. Hacker & James R. Tucker & Mark G. Novak, 2023. "A Shared Latent Process Model to Correct for Preferential Sampling in Disease Surveillance Systems," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 28(3), pages 483-501, September.
  • Handle: RePEc:spr:jagbes:v:28:y:2023:i:3:d:10.1007_s13253-023-00535-4
    DOI: 10.1007/s13253-023-00535-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13253-023-00535-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13253-023-00535-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lucia Paci & Alan E. Gelfand & and María Asunción Beamonte & Pilar Gargallo & Manuel Salvador, 2020. "Spatial hedonic modelling adjusted for preferential sampling," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(1), pages 169-192, January.
    2. Kate E. Jones & Nikkita G. Patel & Marc A. Levy & Adam Storeygard & Deborah Balk & John L. Gittleman & Peter Daszak, 2008. "Global trends in emerging infectious diseases," Nature, Nature, vol. 451(7181), pages 990-993, February.
    3. A. Lee & A. Szpiro & S.Y. Kim & L. Sheppard, 2015. "Impact of preferential sampling on exposure prediction and health effect inference in the context of air pollution epidemiology," Environmetrics, John Wiley & Sons, Ltd., vol. 26(4), pages 255-267, June.
    4. Ho, Lai Ping & Stoyan, D., 2008. "Modelling marked point patterns by intensity-marked Cox processes," Statistics & Probability Letters, Elsevier, vol. 78(10), pages 1194-1199, August.
    5. Finn Lindgren & Håvard Rue & Johan Lindström, 2011. "An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(4), pages 423-498, September.
    6. Duncan Lee & Claire Ferguson & E. Marian Scott, 2011. "Constructing representative air quality indicators with measures of uncertainty," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 174(1), pages 109-126, January.
    7. Alan E. Gelfand & Sujit K. Sahu & David M. Holland, 2012. "On the effect of preferential sampling in spatial prediction," Environmetrics, John Wiley & Sons, Ltd., vol. 23(7), pages 565-578, November.
    8. Peter Diggle & Sara Morris & Paul Elliott & Gavin Shaddick, 1997. "Regression Modelling of Disease Risk in Relation to Point Sources," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 160(3), pages 491-505, September.
    9. Peter J. Diggle & Raquel Menezes & Ting‐li Su, 2010. "Geostatistical inference under preferential sampling," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 59(2), pages 191-232, March.
    10. D. Pati & B. J. Reich & D. B. Dunson, 2011. "Bayesian geostatistical modelling with informative sampling locations," Biometrika, Biometrika Trust, vol. 98(1), pages 35-48.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lucia Paci & Alan E. Gelfand & and María Asunción Beamonte & Pilar Gargallo & Manuel Salvador, 2020. "Spatial hedonic modelling adjusted for preferential sampling," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(1), pages 169-192, January.
    2. Erin M. Schliep & Christopher K. Wikle & Ranadeep Daw, 2023. "Correcting for informative sampling in spatial covariance estimation and kriging predictions," Journal of Geographical Systems, Springer, vol. 25(4), pages 587-613, October.
    3. Brian J. Reich & Shu Yang & Yawen Guan & Andrew B. Giffin & Matthew J. Miller & Ana Rappold, 2021. "A Review of Spatial Causal Inference Methods for Environmental and Epidemiological Applications," International Statistical Review, International Statistical Institute, vol. 89(3), pages 605-634, December.
    4. C. Forlani & S. Bhatt & M. Cameletti & E. Krainski & M. Blangiardo, 2020. "A joint Bayesian space–time model to integrate spatially misaligned air pollution data in R‐INLA," Environmetrics, John Wiley & Sons, Ltd., vol. 31(8), December.
    5. Brian J. Reich & Howard H. Chang & Kristen M. Foley, 2014. "A spectral method for spatial downscaling," Biometrics, The International Biometric Society, vol. 70(4), pages 932-942, December.
    6. Samira Zahmatkesh & Mohsen Mohammadzadeh, 2021. "Bayesian prediction of spatial data with non-ignorable missingness," Statistical Papers, Springer, vol. 62(5), pages 2247-2268, October.
    7. Zhang, Shen & Liu, Xin & Tang, Jinjun & Cheng, Shaowu & Qi, Yong & Wang, Yinhai, 2018. "Spatio-temporal modeling of destination choice behavior through the Bayesian hierarchical approach," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 512(C), pages 537-551.
    8. John M. Humphreys & Robert B. Srygley & David H. Branson, 2022. "Geographic Variation in Migratory Grasshopper Recruitment under Projected Climate Change," Geographies, MDPI, vol. 2(1), pages 1-19, January.
    9. Johnston, Alison & Moran, Nick & Musgrove, Andy & Fink, Daniel & Baillie, Stephen R., 2020. "Estimating species distributions from spatially biased citizen science data," Ecological Modelling, Elsevier, vol. 422(C).
    10. Kyung-Duk Min & Ho-Jang Kwon & KyooSang Kim & Sun-Young Kim, 2017. "Air Pollution Monitoring Design for Epidemiological Application in a Densely Populated City," IJERPH, MDPI, vol. 14(7), pages 1-12, June.
    11. Martins, Thiago G. & Simpson, Daniel & Lindgren, Finn & Rue, Håvard, 2013. "Bayesian computing with INLA: New features," Computational Statistics & Data Analysis, Elsevier, vol. 67(C), pages 68-83.
    12. Yi Liu & Gavin Shaddick & James V. Zidek, 2017. "Incorporating High-Dimensional Exposure Modelling into Studies of Air Pollution and Health," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(2), pages 559-581, December.
    13. Bivand, Roger & Krivoruchko, Konstantin, 2018. "Big data sampling and spatial analysis: “which of the two ladles, of fig-wood or gold, is appropriate to the soup and the pot?”," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 87-91.
    14. Rajala, T. & Penttinen, A., 2014. "Bayesian analysis of a Gibbs hard-core point pattern model with varying repulsion range," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 530-541.
    15. K. Shuvo Bakar & Nicholas Biddle & Philip Kokic & Huidong Jin, 2020. "A Bayesian spatial categorical model for prediction to overlapping geographical areas in sample surveys," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(2), pages 535-563, February.
    16. Nikolett Orosz & Tünde Tóthné Tóth & Gyöngyi Vargáné Gyuró & Zsoltné Tibor Nábrádi & Klára Hegedűsné Sorosi & Zsuzsa Nagy & Éva Rigó & Ádám Kaposi & Gabriella Gömöri & Cornelia Melinda Adi Santoso & A, 2022. "Comparison of Length of Hospital Stay for Community-Acquired Infections Due to Enteric Pathogens, Influenza Viruses and Multidrug-Resistant Bacteria: A Cross-Sectional Study in Hungary," IJERPH, MDPI, vol. 19(23), pages 1-16, November.
    17. Laura M. Sangalli, 2021. "Spatial Regression With Partial Differential Equation Regularisation," International Statistical Review, International Statistical Institute, vol. 89(3), pages 505-531, December.
    18. Mudassar Arsalan & Omar Mubin & Fady Alnajjar & Belal Alsinglawi, 2020. "COVID-19 Global Risk: Expectation vs. Reality," IJERPH, MDPI, vol. 17(15), pages 1-10, August.
    19. Matthias Katzfuss & Joseph Guinness & Wenlong Gong & Daniel Zilber, 2020. "Vecchia Approximations of Gaussian-Process Predictions," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 25(3), pages 383-414, September.
    20. Finn Lindgren, 2015. "Comments on: Comparing and selecting spatial predictors using local criteria," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(1), pages 35-44, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jagbes:v:28:y:2023:i:3:d:10.1007_s13253-023-00535-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.