IDEAS home Printed from https://ideas.repec.org/a/vrs/offsta/v31y2015i3p415-429n5.html
   My bibliography  Save this article

Coverage Evaluation on Probabilistically Linked Data

Author

Listed:
  • Di Consiglio Loredana
  • Tuoto Tiziana

    (Italian National Statistical Institute - Istat, Via Cesare Balbo, 16 00184 Rome, Italy)

Abstract

The Capture-recapture method is a well-known solution for evaluating the unknown size of a population. Administrative data represent sources of independent counts of a population and can be jointly exploited for applying the capture-recapture method. Of course, administrative sources are affected by over- or undercoverage when considered separately. The standard Petersen approach is based on strong assumptions, including perfect record linkage between lists. In reality, record linkage results can be affected by errors. A simple method for achieving linkage error-unbiased population total estimates is proposed in Ding and Fienberg (1994). In this article, an extension of the Ding and Fienberg model by relaxing their conditions is proposed. The procedures are illustrated for estimating the total number of road casualties, on the basis of a probabilistic record linkage between two administrative data sources. Moreover, a simulation study is developed, providing evidence that the adjusted estimator always performs better than the Petersen estimator.

Suggested Citation

  • Di Consiglio Loredana & Tuoto Tiziana, 2015. "Coverage Evaluation on Probabilistically Linked Data," Journal of Official Statistics, Sciendo, vol. 31(3), pages 415-429, September.
  • Handle: RePEc:vrs:offsta:v:31:y:2015:i:3:p:415-429:n:5
    DOI: 10.1515/jos-2015-0025
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/jos-2015-0025
    Download Restriction: no

    File URL: https://libkey.io/10.1515/jos-2015-0025?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Bartolucci, Francesco & Forcina, Antonio, 2006. "A Class of Latent Marginal Models for CaptureRecapture Data With Continuous Covariates," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 786-794, June.
    2. Chen Z. & Kuo L., 2001. "A Note on the Estimation of the Multinomial Logit Model With Random Effects," The American Statistician, American Statistical Association, vol. 55, pages 89-95, May.
    3. Brent A. Coull & Alan Agresti, 1999. "The Use of Mixed Logit Models to Reflect Heterogeneity in Capture-Recapture Studies," Biometrics, The International Biometric Society, vol. 55(1), pages 294-301, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Di Consiglio Loredana & Tuoto Tiziana, 2018. "Population Size Estimation and Linkage Errors: the Multiple Lists Case," Journal of Official Statistics, Sciendo, vol. 34(4), pages 889-908, December.
    2. Zhang Li-Chun, 2019. "A Note on Dual System Population Size Estimator," Journal of Official Statistics, Sciendo, vol. 35(1), pages 279-283, March.
    3. Bijak Jakub & Bryant Johan & Gołata Elżbieta & Smallwood Steve, 2021. "Preface," Journal of Official Statistics, Sciendo, vol. 37(3), pages 533-541, September.
    4. de Wolf Peter-Paul & van der Laan Jan & Zult Daan, 2019. "Connecting Correction Methods for Linkage Error in Capture-Recapture," Journal of Official Statistics, Sciendo, vol. 35(3), pages 577-597, September.
    5. Ton de Waal & Arnout van Delden & Sander Scholtus, 2020. "Multi‐source Statistics: Basic Situations and Methods," International Statistical Review, International Statistical Institute, vol. 88(1), pages 203-228, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alessio Farcomeni, 2015. "Latent class recapture models with flexible behavioural response," Statistica, Department of Statistics, University of Bologna, vol. 75(1), pages 5-17.
    2. Francesco Bartolucci & Fulvia Pennoni, 2007. "A Class of Latent Markov Models for Capture–Recapture Data Allowing for Time, Heterogeneity, and Behavior Effects," Biometrics, The International Biometric Society, vol. 63(2), pages 568-578, June.
    3. Danilo Alunni Fegatelli & Luca Tardella, 2016. "Flexible behavioral capture–recapture modeling," Biometrics, The International Biometric Society, vol. 72(1), pages 125-135, March.
    4. Di Consiglio Loredana & Tuoto Tiziana, 2018. "Population Size Estimation and Linkage Errors: the Multiple Lists Case," Journal of Official Statistics, Sciendo, vol. 34(4), pages 889-908, December.
    5. Shira Mitchell & Al Ozonoff & Alan M. Zaslavsky & Bethany Hedt-Gauthier & Kristian Lum & Brent A. Coull, 2013. "A Comparison of Marginal and Conditional Models for Capture–Recapture Data with Application to Human Rights Violations Data," Biometrics, The International Biometric Society, vol. 69(4), pages 1022-1032, December.
    6. Baffour Bernard & Brown James J. & Smith Peter W.F., 2021. "Latent Class Analysis for Estimating an Unknown Population Size – with Application to Censuses," Journal of Official Statistics, Sciendo, vol. 37(3), pages 673-697, September.
    7. Nikolaj Malchow-Møller & Michael Svarer, 2003. "Estimation of the multinomial logit model with random effects," Applied Economics Letters, Taylor & Francis Journals, vol. 10(7), pages 389-392.
    8. Mevin B. Hooten & Michael R. Schwob & Devin S. Johnson & Jacob S. Ivan, 2023. "Multistage hierarchical capture–recapture models," Environmetrics, John Wiley & Sons, Ltd., vol. 34(6), September.
    9. Brent A. Coull & Alan Agresti, 2000. "Random Effects Modeling of Multiple Binomial Responses Using the Multivariate Binomial Logit-Normal Distribution," Biometrics, The International Biometric Society, vol. 56(1), pages 73-80, March.
    10. Ben C. Stevenson & Rachel M. Fewster & Koustubh Sharma, 2022. "Spatial correlation structures for detections of individuals in spatial capture–recapture models," Biometrics, The International Biometric Society, vol. 78(3), pages 963-973, September.
    11. Forcina, Antonio, 2008. "Identifiability of extended latent class models with individual covariates," Computational Statistics & Data Analysis, Elsevier, vol. 52(12), pages 5263-5268, August.
    12. Francesco Bartolucci & Antonio Forcina, 2001. "Analysis of Capture-Recapture Data with a Rasch-Type Model Allowing for Conditional Dependence and Multidimensionality," Biometrics, The International Biometric Society, vol. 57(3), pages 714-719, September.
    13. J. Andrew Royle, 2009. "Analysis of Capture–Recapture Models with Individual Covariates Using Data Augmentation," Biometrics, The International Biometric Society, vol. 65(1), pages 267-274, March.
    14. Dardanoni, V & Li Donni, P, 2008. "Testing For Asymmetric Information In Insurance Markets With Unobservable Types," Health, Econometrics and Data Group (HEDG) Working Papers 08/26, HEDG, c/o Department of Economics, University of York.
    15. Janne Petersen & Karen Bandeen-Roche & Esben Budtz-Jørgensen & Klaus Groes Larsen, 2012. "Predicting Latent Class Scores for Subsequent Analysis," Psychometrika, Springer;The Psychometric Society, vol. 77(2), pages 244-262, April.
    16. Bacci, Silvia & Bartolucci, Francesco & Pieroni, Luca, 2012. "A causal analysis of mother’s education on birth inequalities," MPRA Paper 38754, University Library of Munich, Germany.
    17. Jennifer B Smith & Bryan S Stevens & Dwayne R Etter & David M Williams, 2020. "Performance of spatial capture-recapture models with repurposed data: Assessing estimator robustness for retrospective applications," PLOS ONE, Public Library of Science, vol. 15(8), pages 1-16, August.
    18. Louis-Paul Rivest & Sophie Baillargeon, 2007. "Applications and Extensions of Chao's Moment Estimator for the Size of a Closed Population," Biometrics, The International Biometric Society, vol. 63(4), pages 999-1006, December.
    19. Ali Hortacsu & Olivia R. Natan & Hayden Parsley & Timothy Schwieg & Kevin R. Williams, 2021. "Incorporating Search and Sales Information in Demand Estimation," Cowles Foundation Discussion Papers 2313R1, Cowles Foundation for Research in Economics, Yale University, revised Mar 2023.
    20. J. Andrew Royle, 2006. "Site Occupancy Models with Heterogeneous Detection Probabilities," Biometrics, The International Biometric Society, vol. 62(1), pages 97-102, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:vrs:offsta:v:31:y:2015:i:3:p:415-429:n:5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.sciendo.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.