IDEAS home Printed from https://ideas.repec.org/p/cen/wpaper/20-05.html
   My bibliography  Save this paper

Validating Abstract Representations of Spatial Population Data while considering Disclosure Avoidance

Author

Listed:
  • James Gaboardi

Abstract

This paper furthers a research agenda for modeling populations along spatial networks and expands upon an empirical analysis to a full U.S. county (Gaboardi, 2019, Ch. 1,2). Specific foci are the necessity of, and methods for, validating and benchmarking spatial data when conducting social science research with aggregated and ambiguous population representations. In order to promote the validation of publicly-available data, access to highly-restricted census microdata was requested, and granted, in order to determine the levels of accuracy and error associated with a network-based population modeling framework. Primary findings reinforce the utility of a novel network allocation method—populated polygons to networks (pp2n) in terms of accuracy, computational complexity, and real runtime (Gaboardi, 2019, Ch. 2). Also, a pseudo-benchmark dataset’s performance against the true census microdata shows promise in modeling populations along networks.

Suggested Citation

  • James Gaboardi, 2020. "Validating Abstract Representations of Spatial Population Data while considering Disclosure Avoidance," Working Papers 20-05, Center for Economic Studies, U.S. Census Bureau.
  • Handle: RePEc:cen:wpaper:20-05
    as

    Download full text from publisher

    File URL: https://www2.census.gov/ces/wp/2020/CES-WP-20-05.pdf
    File Function: First version, 2020
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. John M. Abowd & Ian M. Schmutte, 2017. "Revisiting the Economics of Privacy: Population Statistics and Confidentiality Protection as Public Goods," Working Papers 17-37, Center for Economic Studies, U.S. Census Bureau.
    2. Seth E Spielman & David C Folch, 2015. "Reducing Uncertainty in the American Community Survey through Data-Driven Regionalization," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-21, February.
    3. Jonathan R. Bradley & Christopher K. Wikle & Scott H. Holan, 2017. "Regionalization of multiscale spatial processes by using a criterion for spatial aggregation error," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(3), pages 815-832, June.
    4. David C. Folch & Daniel Arribas-Bel & Julia Koschinsky & Seth E. Spielman, 2016. "Spatial Variation in the Quality of American Community Survey Estimates," Demography, Springer;Population Association of America (PAA), vol. 53(5), pages 1535-1554, October.
    5. Seth E. Spielman & Alex Singleton, 2015. "Studying Neighborhoods Using Uncertain Data from the American Community Survey: A Contextual Approach," Annals of the American Association of Geographers, Taylor & Francis Journals, vol. 105(5), pages 1003-1025, September.
    6. Steven Ruggles & Catherine Fitch & Diana Magnuson & Jonathan Schroeder, 2019. "Differential Privacy and Census Data: Implications for Social and Economic Research," AEA Papers and Proceedings, American Economic Association, vol. 109, pages 403-408, May.
    7. Amatulli, Giuseppe & Peréz-Cabello, Fernando & de la Riva, Juan, 2007. "Mapping lightning/human-caused wildfires occurrence under ignition point location uncertainty," Ecological Modelling, Elsevier, vol. 200(3), pages 321-333.
    8. James Gaboardi, 2020. "Validating Abstract Representations of Spatial Population Data while considering Disclosure Avoidance," Working Papers 20-5, Center for Economic Studies, U.S. Census Bureau.
    9. Derek S. Young & Andrew M. Raim & Nancy R. Johnson, 2017. "Zero-inflated modelling for characterizing coverage errors of extracts from the US Census Bureau's Master Address File," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 180(1), pages 73-97, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. James Gaboardi, 2020. "Validating Abstract Representations of Spatial Population Data while considering Disclosure Avoidance," Working Papers 20-5, Center for Economic Studies, U.S. Census Bureau.
    2. Daniel H. Weinberg & John M. Abowd & Robert F. Belli & Noel Cressie & David C. Folch & Scott H. Holan & Margaret C. Levenstein & Kristen M. Olson & Jerome P. Reiter & Matthew D. Shapiro & Jolene Smyth, 2017. "Effects of a Government-Academic Partnership: Has the NSF-Census Bureau Research Network Helped Improve the U.S. Statistical System?," Working Papers 17-59r, Center for Economic Studies, U.S. Census Bureau.
    3. David C. Folch & Seth Spielman & Molly Graber, 2023. "The Impact of Covariance on American Community Survey Margins of Error: Computational Alternatives," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 42(4), pages 1-23, August.
    4. Raoul S. Liévanos & Amy Lubitow & Julius Alexander McGee, 2019. "Misrecognition in a Sustainability Capital: Race, Representation, and Transportation Survey Response Rates in the Portland Metropolitan Area," Sustainability, MDPI, vol. 11(16), pages 1-33, August.
    5. Arthur Acolin & Ari Decter-Frain & Matt Hall, 2022. "Small-area estimates from consumer trace data," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 47(27), pages 843-882.
    6. Kuter, Semih & Usul, Nurunnisa & Kuter, Nazan, 2011. "Bandwidth determination for kernel density analysis of wildfire events at forest sub-district scale," Ecological Modelling, Elsevier, vol. 222(17), pages 3033-3040.
    7. John M. Abowd & Ian M. Schmutte & William Sexton & Lars Vilhuber, 2019. "Suboptimal Provision of Privacy and Statistical Accuracy When They are Public Goods," Papers 1906.09353, arXiv.org.
    8. Neil Lee & Andrés Rodríguez-Pose, 2016. "Is There Trickle-Down from Tech? Poverty, Employment, and the High-Technology Multiplier in U.S. Cities," Annals of the American Association of Geographers, Taylor & Francis Journals, vol. 106(5), pages 1114-1134, September.
    9. Rodríguez-Pose, Andrés & Lee, Neil, 2020. "Hipsters vs. geeks? Creative workers, STEM and innovation in US cities," LSE Research Online Documents on Economics 103974, London School of Economics and Political Science, LSE Library.
    10. Livio Finos & Fortunato Pesarin, 2020. "On zero-inflated permutation testing and some related problems," Statistical Papers, Springer, vol. 61(5), pages 2157-2174, October.
    11. Sigurd Dyrting & Abraham Flaxman & Ethan Sharygin, 2022. "Reconstruction of age distributions from differentially private census data," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 41(6), pages 2311-2329, December.
    12. Lun Liu & Elisabete A Silva & Ying Long, 2019. "Block-level changes in the socio-spatial landscape in Beijing: Trends and processes," Urban Studies, Urban Studies Journal Limited, vol. 56(6), pages 1198-1214, May.
    13. Neil Lee & Andrés Rodríguez-Pose, 2021. "Entrepreneurship and the fight against poverty in US cities," Environment and Planning A, , vol. 53(1), pages 31-52, February.
    14. Daniel J. Hicks, 2020. "Census Demographics and Chlorpyrifos Use in California’s Central Valley, 2011–15: A Distributional Environmental Justice Analysis," IJERPH, MDPI, vol. 17(7), pages 1-20, April.
    15. Michler, Jeffrey D. & Josephson, Anna & Kilic, Talip & Murray, Siobhan, 2022. "Privacy protection, measurement error, and the integration of remote sensing and socioeconomic survey data," Journal of Development Economics, Elsevier, vol. 158(C).
    16. Benavent-Corai, J. & Rojo, C. & Suárez-Torres, J. & Velasco-García, L., 2007. "Scaling properties in forest fire sequences: The human role in the order of nature," Ecological Modelling, Elsevier, vol. 205(3), pages 336-342.
    17. Levi J Wolf & Elijah Knaap & Sergio Rey, 2021. "Geosilhouettes: Geographical measures of cluster fit," Environment and Planning B, , vol. 48(3), pages 521-539, March.
    18. Feliu Serra-Burriel & Pedro Delicado & Fernando M. Cucchietti, 2021. "Wildfires Vegetation Recovery through Satellite Remote Sensing and Functional Data Analysis," Mathematics, MDPI, vol. 9(11), pages 1-22, June.
    19. Ghafar Salavati & Ebrahim Saniei & Ebrahim Ghaderpour & Quazi K. Hassan, 2022. "Wildfire Risk Forecasting Using Weights of Evidence and Statistical Index Models," Sustainability, MDPI, vol. 14(7), pages 1-15, March.
    20. Raoul S. Liévanos & Clare R. Evans & Ryan Light, 2021. "An Intercategorical Ecology of Lead Exposure: Complex Environmental Health Vulnerabilities in the Flint Water Crisis," IJERPH, MDPI, vol. 18(5), pages 1-23, February.

    More about this item

    Keywords

    network allocation; Master Address File; population representation;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cen:wpaper:20-05. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Dawn Anderson (email available below). General contact details of provider: https://edirc.repec.org/data/cesgvus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.