IDEAS home Printed from https://ideas.repec.org/a/sae/envirb/v51y2024i2p553-562.html

Synthetic population data for small area estimation in the United States

Author

Listed:
  • Yue Lin

Abstract

Small area estimation is critical for a wide range of applications, including urban planning, funding distribution, and policy formulation. Individual-level population data, which typically include each individual’s socio-demographic characteristics and small area location, are a rich source of information for small area estimation. However, individual-level population data are often not made public due to confidentiality concerns. This paper describes the development of a public-use synthetic individual-level population dataset in the United States that can be useful for small area estimation. This dataset contains characteristics of housing type, age, sex, race, and Hispanic or Latino origin for all 308,745,538 individuals in the United States at the census block group level, based on publicly available aggregated data from the 2010 Census. Experimental results suggest the validity of the synthetic data by comparing it to different data sources, and we show examples of how this dataset can be used in small area estimation.

Suggested Citation

  • Yue Lin, 2024. "Synthetic population data for small area estimation in the United States," Environment and Planning B, , vol. 51(2), pages 553-562, February.
  • Handle: RePEc:sae:envirb:v:51:y:2024:i:2:p:553-562
    DOI: 10.1177/23998083231215825
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/23998083231215825
    Download Restriction: no

    File URL: https://libkey.io/10.1177/23998083231215825?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Robin Lovelace & Mark Birkin & Dimitris Ballas & Eveline van Leeuwen, 2015. "Evaluating the Performance of Iterative Proportional Fitting for Spatial Microsimulation: New Tests for an Established Technique," Journal of Artificial Societies and Social Simulation, Journal of Artificial Societies and Social Simulation, vol. 18(2), pages 1-21.
    2. Robert Tanton & Yogi Vidyattama & Justine McNamara & Quoc Ngu Vu & Ann Harding, 2009. "Old, Single and Poor: Using Microsimulation and Microdata to Analyse Poverty and the Impact of Policy Change among Older Australians," Economic Papers, The Economic Society of Australia, vol. 28(2), pages 102-120, June.
    3. Yue Lin & Ningchuan Xiao, 2023. "Assessing the Impact of Differential Privacy on Population Uniques in Geographically Aggregated Data: The Case of the 2020 U.S. Census," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 42(5), pages 1-20, October.
    4. Spooner, Fiona & Abrams, Jesse F. & Morrissey, Karyn & Shaddick, Gavin & Batty, Michael & Milton, Richard & Dennett, Adam & Lomax, Nik & Malleson, Nick & Nelissen, Natalie & Coleman, Alex & Nur, Jamil, 2021. "A dynamic microsimulation model for epidemics," Social Science & Medicine, Elsevier, vol. 291(C).
    5. Farooq, Bilal & Bierlaire, Michel & Hurtubia, Ricardo & Flötteröd, Gunnar, 2013. "Simulation based population synthesis," Transportation Research Part B: Methodological, Elsevier, vol. 58(C), pages 243-263.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Abdoul Razac Sané & Pierre-Olivier Vandanjon & Rachid Belaroussi & Pierre Hankach, 2025. "A comprehensive investigation of variational auto-encoders for population synthesis," Journal of Computational Social Science, Springer, vol. 8(1), pages 1-34, February.
    2. Xiong Linping & Zhang Lulu & Tang Weidong & Liu Hong, 2010. "Evaluating Sustainability of Medical Insurance Scheme for Urban Employed Individuals in China," The Geneva Papers on Risk and Insurance - Issues and Practice, Palgrave Macmillan;The Geneva Association, vol. 35(4), pages 600-625, October.
    3. Martin Johnsen & Oliver Brandt & Sergio Garrido & Francisco C. Pereira, 2020. "Population synthesis for urban resident modeling using deep generative models," Papers 2011.06851, arXiv.org.
    4. Rosalind Wallace & Rachel Franklin & Susan Grant-Muller & Alison Heppenstall & Victoria Houlden, 2022. "Estimating the social and spatial impacts of Covid mitigation strategies in United Kingdom regions: synthetic data and dashboards [Developing a sustainable exit strategy for COVID-19: health, economic and public policy implications]," Cambridge Journal of Regions, Economy and Society, Cambridge Political Economy Society, vol. 15(3), pages 683-702.
    5. Rachid Belaroussi & Younes Delhoum, 2024. "Forecasting Daily Activity Plans of a Synthetic Population in an Upcoming District," Forecasting, MDPI, vol. 6(2), pages 1-26, May.
    6. Nick Malleson & Rachel Franklin & Daniel Arribas-Bel & Tao Cheng & Mark Birkin, 2024. "Digital twins on trial: Can they actually solve wicked societal problems and change the world for better?," Environment and Planning B, , vol. 51(6), pages 1181-1186, July.
    7. Robert Tanton, 2014. "A Review of Spatial Microsimulation Methods," International Journal of Microsimulation, International Microsimulation Association, vol. 7(1), pages 4-25.
    8. Itismita Mohanty & Robert Tanton & Yogi Vidyattama & Marcia Keegan & Robert Cummins, 2013. "‘Small area estimates of Subjective Wellbeing: Spatial Microsimulation on the Australian Unity Wellbeing Index Survey’," NATSEM Working Paper Series 13/23, University of Canberra, National Centre for Social and Economic Modelling.
    9. Ricardo Hurtubia & Francisco Javier Martinez & Michel Bierlaire, 2019. "A quasi-equilibrium approach for market clearing in land use microsimulations," Environment and Planning B, , vol. 46(3), pages 445-468, March.
    10. Mohamed Khachman & Catherine Morency & Francesco Ciari, 2024. "Integrated multiresolution framework for spatialized population synthesis," Transportation, Springer, vol. 51(3), pages 823-852, June.
    11. Nicholas Fournier & Eleni Christofa & Arun Prakash Akkinepally & Carlos Lima Azevedo, 2021. "Integrated population synthesis and workplace assignment using an efficient optimization-based person-household matching method," Transportation, Springer, vol. 48(2), pages 1061-1087, April.
    12. Ann Harding & Robert Tanton, 2014. "Policy and people at the small-area level: using micro-simulation to create synthetic spatial data," Chapters, in: Robert Stimson (ed.), Handbook of Research Methods and Applications in Spatially Integrated Social Science, chapter 25, pages 560-586, Edward Elgar Publishing.
    13. ANTONI Jean-Philippe & VUIDEL Gilles & KLEIN Olivier, 2017. "Generating a located synthetic population of individuals, households, and dwellings," LISER Working Paper Series 2017-07, Luxembourg Institute of Socio-Economic Research (LISER).
    14. Melvin Wong & Bilal Farooq, 2019. "Information processing constraints in travel behaviour modelling: A generative learning approach," Papers 1907.07036, arXiv.org, revised Jul 2019.
    15. Ricardo Crespo & Ignacio Hernandez, 2020. "On the spatially explicit Gini coefficient: the case study of Chile—a high-income developing country," Letters in Spatial and Resource Sciences, Springer, vol. 13(1), pages 37-47, April.
    16. Vo, Khoa D. & Kim, Eui-Jin & Bansal, Prateek, 2025. "A novel data fusion method to leverage passively-collected mobility data in generating spatially-heterogeneous synthetic population," Transportation Research Part B: Methodological, Elsevier, vol. 191(C).
    17. Verbist, Gerlinde & Goedemé, Tim & Van den Bosch, Karel & Salanauskaite, Lina, 2013. "Testing the statistical significance of microsimulation results: often easier than you think. A technical note," EUROMOD Working Papers EM18/13, EUROMOD at the Institute for Social and Economic Research.
    18. Chen, Xiaowei & Hamim, Omar Faruqe & Moras, Bruno Cesar Krause & Gkritza, Konstantina & Ukkusuri, Satish V., 2025. "Generative AI-driven framework for estimating future electric vehicle usage," Applied Energy, Elsevier, vol. 401(PB).
    19. Tang, Justin Hayse Chiwing G. & Liu, Junbei & Chen, Anthony & Wang, Bobin & Zhuge, Chengxiang & Yang, Xiong, 2025. "Exploring the potential adoption of Mobility-as-a-Service in Beijing: A spatial agent-based model," Transportation Research Part A: Policy and Practice, Elsevier, vol. 194(C).
    20. Spooner, Fiona & Abrams, Jesse F. & Morrissey, Karyn & Shaddick, Gavin & Batty, Michael & Milton, Richard & Dennett, Adam & Lomax, Nik & Malleson, Nick & Nelissen, Natalie & Coleman, Alex & Nur, Jamil, 2021. "A dynamic microsimulation model for epidemics," Social Science & Medicine, Elsevier, vol. 291(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:envirb:v:51:y:2024:i:2:p:553-562. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.