IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0231935.html
   My bibliography  Save this article

A Bayesian unified framework for risk estimation and cluster identification in small area health data analysis

Author

Listed:
  • K C Flórez
  • A Corberán-Vallet
  • A Iftimi
  • J D Bermúdez

Abstract

Many statistical models have been proposed to analyse small area disease data with the aim of describing spatial variation in disease risk. In this paper, we propose a Bayesian hierarchical model that simultaneously allows for risk estimation and cluster identification. Our model formulation assumes that there is an unknown number of risk classes and small areas are assigned to a risk class by means of independent allocation variables. Therefore, areas within each cluster are assumed to share a common risk but they may be geographically separated. The posterior distribution of the parameter representing the number of risk classes is estimated using a novel procedure that combines its prior distribution with an efficient estimate of the marginal likelihood of the data given this parameter. An extension of the model incorporating covariates is also shown. These covariates may incorporate additional information on the problem or they may account for spatial correlation in the data. We illustrate the performance of the proposed model through both a simulation study and a case study of reported cases of varicella in the city of Valencia, Spain.

Suggested Citation

  • K C Flórez & A Corberán-Vallet & A Iftimi & J D Bermúdez, 2020. "A Bayesian unified framework for risk estimation and cluster identification in small area health data analysis," PLOS ONE, Public Library of Science, vol. 15(5), pages 1-17, May.
  • Handle: RePEc:plo:pone00:0231935
    DOI: 10.1371/journal.pone.0231935
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0231935
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0231935&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0231935?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Leonhard Knorr-Held & Günter Raßer, 2000. "Bayesian Detection of Clusters and Discontinuities in Disease Maps," Biometrics, The International Biometric Society, vol. 56(1), pages 13-21, March.
    2. Ronald E. Gangnon & Murray K. Clayton, 2000. "Bayesian Detection and Modeling of Spatial Disease Clustering," Biometrics, The International Biometric Society, vol. 56(3), pages 922-935, September.
    3. Aris Perperoglou & Paul Eilers, 2010. "Penalized regression with individual deviance effects," Computational Statistics, Springer, vol. 25(2), pages 341-361, June.
    4. Kazem Nasserinejad & Joost van Rosmalen & Wim de Kort & Emmanuel Lesaffre, 2017. "Comparison of Criteria for Choosing the Number of Classes in Bayesian Finite Mixture Models," PLOS ONE, Public Library of Science, vol. 12(1), pages 1-23, January.
    5. D. G. T. Denison & C. C. Holmes, 2001. "Bayesian Partitioning for Estimating Disease Risk," Biometrics, The International Biometric Society, vol. 57(1), pages 143-149, March.
    6. Julian Besag & Jeremy York & Annie Mollié, 1991. "Bayesian image restoration, with two applications in spatial statistics," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 43(1), pages 1-20, March.
    7. Green P.J. & Richardson S., 2002. "Hidden Markov Models and Disease Mapping," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 1055-1070, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Minge Xie & Qiankun Sun & Joseph Naus, 2009. "A Latent Model to Detect Multiple Clusters of Varying Sizes," Biometrics, The International Biometric Society, vol. 65(4), pages 1011-1020, December.
    2. Goepp, Vivien & van de Kassteele, Jan, 2024. "Graph-based spatial segmentation of areal data," Computational Statistics & Data Analysis, Elsevier, vol. 192(C).
    3. Leonhard Knorr-Held & Günter Raßer & Nikolaus Becker, 2002. "Disease Mapping of Stage-Specific Cancer Incidence Data," Biometrics, The International Biometric Society, vol. 58(3), pages 492-501, September.
    4. Douglas R. M. Azevedo & Marcos O. Prates & Dipankar Bandyopadhyay, 2021. "MSPOCK: Alleviating Spatial Confounding in Multivariate Disease Mapping Models," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 26(3), pages 464-491, September.
    5. Håvard Rue & Ingelin Steinsland & Sveinung Erland, 2004. "Approximating hidden Gaussian Markov random fields," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 66(4), pages 877-892, November.
    6. Ugarte, M.D. & Goicoa, T. & Militino, A.F., 2009. "Empirical Bayes and Fully Bayes procedures to detect high-risk areas in disease mapping," Computational Statistics & Data Analysis, Elsevier, vol. 53(8), pages 2938-2949, June.
    7. Vidal Rodeiro, Carmen L. & Lawson, Andrew B., 2005. "An evaluation of the edge effects in disease map modelling," Computational Statistics & Data Analysis, Elsevier, vol. 49(1), pages 45-62, April.
    8. Shota Homma & Daisuke Murakami & Shinya Hosokawa & Koji Kanefuji, 2025. "Introduction risk of fire ants through container cargo in ports: Data integration approach considering a logistic network," PLOS ONE, Public Library of Science, vol. 20(2), pages 1-15, February.
    9. Congdon, Peter, 2007. "Mixtures of spatial and unstructured effects for spatially discontinuous health outcomes," Computational Statistics & Data Analysis, Elsevier, vol. 51(6), pages 3197-3212, March.
    10. D. G. T. Denison & C. C. Holmes, 2001. "Bayesian Partitioning for Estimating Disease Risk," Biometrics, The International Biometric Society, vol. 57(1), pages 143-149, March.
    11. Marco Alfò & Cecilia Vitiello, 2003. "Finite mixtures approach to ecological regression," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 12(1), pages 93-108, February.
    12. Duncan Lee & Richard Mitchell, 2013. "Locally adaptive spatial smoothing using conditional auto-regressive models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 62(4), pages 593-608, August.
    13. Miklos Arato, N. & Dryden, Ian L. & Taylor, Charles C., 2006. "Hierarchical Bayesian modelling of spatial age-dependent mortality," Computational Statistics & Data Analysis, Elsevier, vol. 51(2), pages 1347-1363, November.
    14. Dolores Catelan & Annibale Biggeri & Corrado Lagazio, 2009. "On the clustering term in ecological analysis: how do different prior specifications affect results?," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 18(1), pages 49-61, March.
    15. Deborah A. Costain, 2009. "Bayesian Partitioning for Modeling and Mapping Spatial Case–Control Data," Biometrics, The International Biometric Society, vol. 65(4), pages 1123-1132, December.
    16. repec:jss:jstsof:36:i10 is not listed on IDEAS
    17. Katie Wilson & Jon Wakefield, 2022. "A probabilistic model for analyzing summary birth history data," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 47(11), pages 291-344.
    18. Eibich, Peter & Ziebarth, Nicolas, 2014. "Examining the Structure of Spatial Health Effects in Germany Using Hierarchical Bayes Models," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 49, pages 305-320.
    19. Mayer Alvo & Jingrui Mu, 2023. "COVID-19 Data Analysis Using Bayesian Models and Nonparametric Geostatistical Models," Mathematics, MDPI, vol. 11(6), pages 1-13, March.
    20. Spezia, L. & Cooksley, S.L. & Brewer, M.J. & Donnelly, D. & Tree, A., 2014. "Modelling species abundance in a river by Negative Binomial hidden Markov models," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 599-614.
    21. Zhengyi Zhou & David S. Matteson & Dawn B. Woodard & Shane G. Henderson & Athanasios C. Micheas, 2015. "A Spatio-Temporal Point Process Model for Ambulance Demand," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 6-15, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0231935. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.