IDEAS home Printed from https://ideas.repec.org/a/bla/jorssa/v185y2022i1p156-177.html
   My bibliography  Save this article

Multiple system estimation using covariates having missing values and measurement error: Estimating the size of the Māori population in New Zealand

Author

Listed:
  • Peter G. M. van der Heijden
  • Maarten Cruyff
  • Paul A. Smith
  • Christine Bycroft
  • Patrick Graham
  • Nathaniel Matheson‐Dunning

Abstract

We investigate the use of two or more linked lists, for both population size estimation and the relationship between variables appearing on all or only some lists. This relationship is usually not fully known because some individuals appear in only some lists, and some are not in any list. These two problems have been solved simultaneously using the EM algorithm. We extend this approach to estimate the size of the indigenous Māori population in New Zealand, leading to several innovations: (1) the approach is extended to four lists (including the population census), where the reporting of Māori status differs between registers; (2) some individuals in one or more lists have missing ethnicity, and we adapt the approach to handle this additional missingness; (3) some lists cover subsets of the population by design. We discuss under which assumptions such structural undercoverage can be ignored and provide a general result; (4) we treat the Māori indicator in each list as a variable measured with error, and embed a latent class model in the multiple system estimation to estimate the population size of a latent variable, interpreted as the true Māori status. Finally, we discuss estimating the Māori population size from administrative data only. Supplementary materials for our article are available online.

Suggested Citation

  • Peter G. M. van der Heijden & Maarten Cruyff & Paul A. Smith & Christine Bycroft & Patrick Graham & Nathaniel Matheson‐Dunning, 2022. "Multiple system estimation using covariates having missing values and measurement error: Estimating the size of the Māori population in New Zealand," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(1), pages 156-177, January.
  • Handle: RePEc:bla:jorssa:v:185:y:2022:i:1:p:156-177
    DOI: 10.1111/rssa.12731
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssa.12731
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssa.12731?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Di Cecco Davide & Di Zio Marco & Filipponi Danila & Rocchetti Irene, 2018. "Population Size Estimation Using Multiple Incomplete Lists with Overcoverage," Journal of Official Statistics, Sciendo, vol. 34(2), pages 557-572, June.
    2. Elena Stanghellini & Peter G. M. van der Heijden, 2004. "A Multiple-Record Systems Estimation Method that Takes Observed and Unobserved Heterogeneity into Account," Biometrics, The International Biometric Society, vol. 60(2), pages 510-516, June.
    3. Ludi Simpson & Stephen Jivraj & James Warren, 2016. "The stability of ethnic identity in England and Wales 2001–2011," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(4), pages 1025-1049, October.
    4. Laura Boeschoten & Ton de Waal & Jeroen K. Vermunt, 2019. "Estimating the number of serious road injuries per vehicle type in the Netherlands by using multiple imputation of latent classes," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 182(4), pages 1463-1486, October.
    5. Boeschoten Laura & Oberski Daniel & de Waal Ton, 2017. "Estimating Classification Errors Under Edit Restrictions in Composite Survey-Register Data Using Multiple Imputation Latent Class Modelling (MILC)," Journal of Official Statistics, Sciendo, vol. 33(4), pages 921-962, December.
    6. Jason M. Sutherland & Carl James Schwarz & Louis-Paul Rivest, 2007. "Multilist Population Estimation with Incomplete and Partial Stratification," Biometrics, The International Biometric Society, vol. 63(3), pages 910-916, September.
    7. David J. Hand, 2018. "Statistical challenges of administrative and transaction data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(3), pages 555-605, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ton de Waal & Arnout van Delden & Sander Scholtus, 2020. "Multi‐source Statistics: Basic Situations and Methods," International Statistical Review, International Statistical Institute, vol. 88(1), pages 203-228, April.
    2. Fiona Shalley & Kalinda Griffiths & Tom Wilson, 2023. "No Longer Indigenous," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 42(4), pages 1-27, August.
    3. Albert Sabater & Gemma Catney, 2019. "Unpacking Summary Measures of Ethnic Residential Segregation Using an Age Group and Age Cohort Perspective," European Journal of Population, Springer;European Association for Population Studies, vol. 35(1), pages 161-189, February.
    4. Lothian Jack & Holmberg Anders & Seyb Allyson, 2019. "An Evolutionary Schema for Using “it-is-what-it-is” Data in Official Statistics," Journal of Official Statistics, Sciendo, vol. 35(1), pages 137-165, March.
    5. Jonas F. Schenkel & Li‐Chun Zhang, 2022. "Adjusting misclassification using a second classifier with an external validation sample," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 1882-1902, October.
    6. Marušić Zrinka & Kožul Marijana & Brozović Ivana, 2020. "Measuring non-commercial tourism traffic in Croatia: Challenges of using administrative data," Croatian Review of Economic, Business and Social Statistics, Sciendo, vol. 6(2), pages 69-81, December.
    7. Di Cecco Davide & Di Zio Marco & Filipponi Danila & Rocchetti Irene, 2018. "Population Size Estimation Using Multiple Incomplete Lists with Overcoverage," Journal of Official Statistics, Sciendo, vol. 34(2), pages 557-572, June.
    8. Paul Labonne & Martin Weale, 2020. "Temporal disaggregation of overlapping noisy quarterly data: estimation of monthly output from UK value‐added tax data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(3), pages 1211-1230, June.
    9. Stephanie Coffey, PhD. & Jaya Damineni & John Eltinge, PhD. & Anup Mathur, PhD. & Kayla Varela & Allison Zotti, 2023. "Some Open Questions on Multiple-Source Extensions of Adaptive-Survey Design Concepts and Methods," Working Papers 23-03, Center for Economic Studies, U.S. Census Bureau.
    10. Shira Mitchell & Al Ozonoff & Alan M. Zaslavsky & Bethany Hedt-Gauthier & Kristian Lum & Brent A. Coull, 2013. "A Comparison of Marginal and Conditional Models for Capture–Recapture Data with Application to Human Rights Violations Data," Biometrics, The International Biometric Society, vol. 69(4), pages 1022-1032, December.
    11. Heijden Peter G.M. van der & Smith Paul A. & Cruyff Maarten & Bakker Bart, 2018. "An Overview of Population Size Estimation where Linking Registers Results in Incomplete Covariates, with an Application to Mode of Transport of Serious Road Casualties," Journal of Official Statistics, Sciendo, vol. 34(1), pages 239-263, March.
    12. Gričar Sergej & Baldigara Tea, 2019. "An explorative study of tourism time series: Evidence from Slovenia and Croatia," Croatian Review of Economic, Business and Social Statistics, Sciendo, vol. 5(2), pages 101-116, December.
    13. Serena Pattaro & Nick Bailey & Chris Dibben, 2020. "Using Linked Longitudinal Administrative Data to Identify Social Disadvantage," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 147(3), pages 865-895, February.
    14. Baffour Bernard & Brown James J. & Smith Peter W.F., 2021. "Latent Class Analysis for Estimating an Unknown Population Size – with Application to Censuses," Journal of Official Statistics, Sciendo, vol. 37(3), pages 673-697, September.
    15. Jae‐Kwang Kim & Siu‐Ming Tam, 2021. "Data Integration by Combining Big Data and Survey Sample Data for Finite Population Inference," International Statistical Review, International Statistical Institute, vol. 89(2), pages 382-401, August.
    16. L. Boeschoten & M. A. Croon & D. L. Oberski, 2019. "A Note on Applying the BCH Method Under Linear Equality and Inequality Constraints," Journal of Classification, Springer;The Classification Society, vol. 36(3), pages 566-575, October.
    17. James Jackson & Robin Mitra & Brian Francis & Iain Dove, 2022. "Using saturated count models for user‐friendly synthesis of large confidential administrative databases," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 1613-1643, October.
    18. Danilo Fegatelli & Luca Tardella, 2013. "Improved inference on capture recapture models with behavioural effects," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 22(1), pages 45-66, March.
    19. Saville, Christopher W.N., 2020. "Mental health consequences of minority political positions: The case of brexit," Social Science & Medicine, Elsevier, vol. 258(C).
    20. Zhang Li-Chun, 2019. "A Note on Dual System Population Size Estimator," Journal of Official Statistics, Sciendo, vol. 35(1), pages 279-283, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssa:v:185:y:2022:i:1:p:156-177. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.