IDEAS home Printed from https://ideas.repec.org/a/kap/poprpr/v41y2022i6d10.1007_s11113-022-09734-2.html
   My bibliography  Save this article

Reconstruction of age distributions from differentially private census data

Author

Listed:
  • Sigurd Dyrting

    (Northern Institute, Charles Darwin University)

  • Abraham Flaxman

    (University of Washington)

  • Ethan Sharygin

    (Portland State University)

Abstract

The age distribution of a population is important for understanding the demand and provision of labor and services, and as a denominator for calculating key age-specific rates such as fertility and mortality. In the US, the most important source of information on age distributions is the decennial census, but a new disclosure avoidance system (DAS) based on differential privacy will inject noise into the data, potentially compromising its utility for small areas and minority populations. In this paper, we explore the question whether there are statistical methods that can be applied to noisy age distributions to enhance the research uses of census data without compromising privacy. We apply a non-parametric method for smoothing with naive or informative priors to age distributions from the 2010 Census via demonstration data which have had the US Census Bureau’s implementation of differential privacy applied. We find that smoothing age distributions can increase the fidelity of the demonstration data to previously published population counts by age. We discuss implications for uses of data from the 2020 US Census and potential consequences for the measurement of population dynamics, health, and disparities.

Suggested Citation

  • Sigurd Dyrting & Abraham Flaxman & Ethan Sharygin, 2022. "Reconstruction of age distributions from differentially private census data," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 41(6), pages 2311-2329, December.
  • Handle: RePEc:kap:poprpr:v:41:y:2022:i:6:d:10.1007_s11113-022-09734-2
    DOI: 10.1007/s11113-022-09734-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11113-022-09734-2
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s11113-022-09734-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Steven Ruggles & Catherine Fitch & Diana Magnuson & Jonathan Schroeder, 2019. "Differential Privacy and Census Data: Implications for Social and Economic Research," AEA Papers and Proceedings, American Economic Association, vol. 109, pages 403-408, May.
    2. Laura McKenna, 2018. "Disclosure Avoidance Techniques Used for the 1970 through 2010 Decennial Censuses of Population and Housing," Working Papers 18-47, Center for Economic Studies, U.S. Census Bureau.
    3. Joop de Beer, 2011. "A new relational method for smoothing and projecting age-specific fertility rates: TOPALS," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 24(18), pages 409-454.
    4. Santos-Lozada, Alexis R & Perez-Rivera, Danilo T & Bhat, Aarti C., 2020. "How differential privacy will affect our understanding of population growth in the United States," SocArXiv pmux7, Center for Open Science.
    5. Sigurd Dyrting, 2020. "Smoothing migration intensities with P-TOPALS," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 43(55), pages 1607-1650.
    6. Alexis R. Santos-Lozada & Jeffrey T. Howard & Ashton M. Verdery, 2020. "How differential privacy will affect our understanding of health disparities in the United States," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 117(24), pages 13405-13412, June.
    7. Joop de Beer, 2012. "Smoothing and projecting age-specific probabilities of death by TOPALS," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 27(20), pages 543-592.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sigurd Dyrting & Andrew Taylor & Tom Wilson, 2024. "Application of P-TOPALS for Smoothing Input Data for Population Projections ‘At the Edge’," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 43(2), pages 1-28, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Steven Ruggles & David Riper, 2022. "The Role of Chance in the Census Bureau Database Reconstruction Experiment," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 41(3), pages 781-788, June.
    2. Tom Wilson & Irina Grossman & Monica Alexander & Phil Rees & Jeromey Temple, 2022. "Methods for Small Area Population Forecasts: State-of-the-Art and Research Needs," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 41(3), pages 865-898, June.
    3. Bernard Baffour & James Raymer, 2019. "Estimating multiregional survivorship probabilities for sparse data: An application to immigrant populations in Australia, 1981–2011," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 40(18), pages 463-502.
    4. Sigurd Dyrting, 2020. "Smoothing migration intensities with P-TOPALS," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 43(55), pages 1607-1650.
    5. Keilman, Nico, 2016. "Household forecasting: Preservation of age patterns," International Journal of Forecasting, Elsevier, vol. 32(3), pages 726-735.
    6. J. Tom Mueller & Alexis R. Santos-Lozada, 2022. "The 2020 US Census Differential Privacy Method Introduces Disproportionate Discrepancies for Rural and Non-White Populations," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 41(4), pages 1417-1430, August.
    7. Heng Xu & Nan Zhang, 2022. "Implications of Data Anonymization on the Statistical Evidence of Disparity," Management Science, INFORMS, vol. 68(4), pages 2600-2618, April.
    8. John M. Abowd & Ian M. Schmutte & William Sexton & Lars Vilhuber, 2019. "Suboptimal Provision of Privacy and Statistical Accuracy When They are Public Goods," Papers 1906.09353, arXiv.org.
    9. Thayer Alshaabi & David R Dewhurst & James P Bagrow & Peter S Dodds & Christopher M Danforth, 2021. "The sociospatial factors of death: Analyzing effects of geospatially-distributed variables in a Bayesian mortality model for Hong Kong," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-20, March.
    10. James Gaboardi, 2020. "Validating Abstract Representations of Spatial Population Data while considering Disclosure Avoidance," Working Papers 20-5, Center for Economic Studies, U.S. Census Bureau.
    11. John M. Abowd & Robert Ashmead & Ryan Cumings-Menon & Simson Garfinkel & Micah Heineck & Christine Heiss & Robert Johns & Daniel Kifer & Philip Leclerc & Ashwin Machanavajjhala & Brett Moran & William, 2022. "The 2020 Census Disclosure Avoidance System TopDown Algorithm," Papers 2204.08986, arXiv.org.
    12. Richelle L. Winkler & Jaclyn L. Butler & Katherine J. Curtis & David Egan-Robertson, 2022. "Differential Privacy and the Accuracy of County-Level Net Migration Estimates," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 41(2), pages 417-435, April.
    13. Michler, Jeffrey D. & Josephson, Anna & Kilic, Talip & Murray, Siobhan, 2022. "Privacy protection, measurement error, and the integration of remote sensing and socioeconomic survey data," Journal of Development Economics, Elsevier, vol. 158(C).
    14. Kashnitsky, Ilya, 2023. "Non-survival to pension age in Denmark and Sweden: a sub-national investigation," OSF Preprints y9ke4, Center for Open Science.
    15. John M Abowd & Michael B Hawes, 2022. "Confidentiality Protection in the 2020 US Census of Population and Housing," Papers 2206.03524, arXiv.org, revised Dec 2022.
    16. Sigurd Dyrting & Andrew Taylor & Tom Wilson, 2024. "Application of P-TOPALS for Smoothing Input Data for Population Projections ‘At the Edge’," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 43(2), pages 1-28, April.
    17. Carl P. Schmertmann & Marcos R. Gonzaga, 2018. "Bayesian Estimation of Age-Specific Mortality and Life Expectancy for Small Areas With Defective Vital Records," Demography, Springer;Population Association of America (PAA), vol. 55(4), pages 1363-1388, August.
    18. Arkadiusz Wiśniowski & Peter Smith & Jakub Bijak & James Raymer & Jonathan Forster, 2015. "Bayesian Population Forecasting: Extending the Lee-Carter Method," Demography, Springer;Population Association of America (PAA), vol. 52(3), pages 1035-1059, June.
    19. Beth Jarosz, 2021. "Poisson Distribution: A Model for Estimating Households by Household Size," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 40(2), pages 149-162, April.
    20. James Gaboardi, 2020. "Validating Abstract Representations of Spatial Population Data while considering Disclosure Avoidance," Working Papers 20-05, Center for Economic Studies, U.S. Census Bureau.

    More about this item

    Keywords

    Census; Privacy; Demography;
    All these keywords.

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:poprpr:v:41:y:2022:i:6:d:10.1007_s11113-022-09734-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.