IDEAS home Printed from https://ideas.repec.org/p/cen/cpaper/2014-02.html
   My bibliography  Save this paper

Estimating Record Linkage False Match Rate for the Person Identification Validation System

Author

Listed:
  • Mary Layne
  • Deborah Wagner
  • Cynthia Rothhaas

Abstract

The Census Bureau Person Identification Validation System (PVS) assigns unique person identifiers to federal, commercial, census, and survey data to facilitate linkages across files. PVS uses probabilistic matching to assign a unique Census Bureau identifier for each person. This paper presents a method to measure the false match rate in PVS following the approach of Belin and Rubin (1995). The Belin and Rubin methodology requires truth data to estimate a mixture model. The parameters from the mixture model are used to obtain point estimates of the false match rate for each of the PVS search modules. The truth data requirement is satisfied by the unique access the Census Bureau has to high quality name, date of birth, address and Social Security (SSN) data. Truth data are quickly created for the Belin and Rubin model and do not involve a clerical review process. These truth data are used to create estimates for the Belin and Rubin parameters, making the approach more feasible. Both observed and modeled false match rates are computed for all search modules in federal administrative records data and commercial data.

Suggested Citation

  • Mary Layne & Deborah Wagner & Cynthia Rothhaas, 2014. "Estimating Record Linkage False Match Rate for the Person Identification Validation System," CARRA Working Papers 2014-02, Center for Economic Studies, U.S. Census Bureau.
  • Handle: RePEc:cen:cpaper:2014-02
    as

    Download full text from publisher

    File URL: https://www.census.gov/content/dam/Census/library/working-papers/2014/adrm/carra-wp-2014-02.pdf
    File Function: First version, 2014
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Deborah Wagner & Mary Lane, 2014. "The Person Identification Validation System (PVS): Applying the Center for Administrative Records Research and Applications’ (CARRA) Record Linkage Software," CARRA Working Papers 2014-01, Center for Economic Studies, U.S. Census Bureau.
    2. Larsen M. D & Rubin D. B, 2001. "Iterative Automated Record Linkage Using Mixture Models," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 32-41, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Illenin Kondo & Kevin Rinz & Natalie Gubbay & Brandon Hawkins & John Voorheis & Abigail Wozniak, 2024. "Granular Income Inequality and Mobility Using IDDA: Exploring Patterns across Race and Ethnicity," NBER Chapters, in: Race, Ethnicity, and Economic Statistics for the 21st Century, National Bureau of Economic Research, Inc.
    2. John M. Abowd & Tamara Adams & Robert Ashmead & David Darais & Sourya Dey & Simson L. Garfinkel & Nathan Goldschlag & Daniel Kifer & Philip Leclerc & Ethan Lew & Scott Moore & Rolando A. Rodr'iguez & , 2023. "The 2010 Census Confidentiality Protections Failed, Here's How and Why," Papers 2312.11283, arXiv.org.
    3. J. David Brown & Misty L. Heggeness & Suzanne M. Dorinski & Lawrence Warren & Moises Yi, 2018. "Understanding the Quality of Alternative Citizenship Data Sources for the 2020 Census," Working Papers 18-38, Center for Economic Studies, U.S. Census Bureau.
    4. Raj Chetty & John N. Friedman & Nathaniel Hendren & Maggie R. Jones & Sonya R. Porter, 2018. "The Opportunity Atlas: Mapping the Childhood Roots of Social Mobility," Working Papers 18-42, Center for Economic Studies, U.S. Census Bureau.
    5. Carolyn A. Liebler & Renuka Bhaskar & Sonya Rastogi, 2014. "Dynamics of Race: Joining, Leaving, and Staying in the American Indian/Alaska Native Race Category between 2000 and 2010," CARRA Working Papers 2014-10, Center for Economic Studies, U.S. Census Bureau.
    6. Misty Heggeness & Marta Murray-Close, 2019. "Manning Up and Womaning Down: How Husbands and Wives Report Earnings When She Earns More," Opportunity and Inclusive Growth Institute Working Papers 28, Federal Reserve Bank of Minneapolis.
    7. Meyer, Bruce D. & Wyse, Angela & Corinth, Kevin, 2023. "The size and Census coverage of the U.S. homeless population," Journal of Urban Economics, Elsevier, vol. 136(C).
    8. J. David Brown & Misty L. Heggeness & Suzanne M. Dorinski & Lawrence Warren & Moises Yi, 2018. "Understanding the Quality of Alternative Citizenship Data Sources for the 2020 Census," Working Papers 18-38r, Center for Economic Studies, U.S. Census Bureau.
    9. John M. Abowd & Tamara Adams & Robert Ashmead & David Darais & Sourya Dey & Simson L. Garfinkel & Nathan Goldschlag & Daniel Kifer & Philip Leclerc & Ethan Lew & Scott Moore & Rolando A. Rodr�guez & R, 2023. "The 2010 Census Confidentiality Protections Failed, Here�s How and Why," Working Papers 23-63, Center for Economic Studies, U.S. Census Bureau.
    10. Carolyn A. Lieble & Sonya Rastogi & Leticia E. Fernandez & James M. Noon & Sharon R. Ennis, 2014. "America’s Churning Races: Race and Ethnic Response Changes between Census 2000 and the 2010 Census," CARRA Working Papers 2014-09, Center for Economic Studies, U.S. Census Bureau.
    11. J. David Brown & Misty L. Heggeness & Suzanne M. Dorinski & Lawrence Warren & Moises Yi, 2019. "Predicting the Effect of Adding a Citizenship Question to the 2020 Census," Demography, Springer;Population Association of America (PAA), vol. 56(4), pages 1173-1194, August.
    12. Randall Akee & Leah R. Clark, 2023. "Universal Preschool Lottery Admissions and Its Effects on Long-Run Earnings and Outcomes," Working Papers 23-09, Center for Economic Studies, U.S. Census Bureau.
    13. Christian Imboden & John Voorheis & Caroline Weber, 2023. "Self-Employment Income Reporting on Surveys," Working Papers 23-19, Center for Economic Studies, U.S. Census Bureau.
    14. Keller Andrew & Mule Vincent T. & Morris Darcy Steeg & Konicki Scott, 2018. "A Distance Metric for Modeling the Quality of Administrative Records for Use in the 2020 U.S. Census," Journal of Official Statistics, Sciendo, vol. 34(3), pages 599-624, September.
    15. Leticia Fern�ndez & Sonya R. Porter & Sharon R. Ennis & Renuka Bhaskar, 2018. "Factors that Influence Change in Hispanic Identification: Evidence from Linked Decennial Census and American Community Survey Data," Working Papers 18-45, Center for Economic Studies, U.S. Census Bureau.
    16. Carolyn A. Liebler & Sonya R. Porter & Leticia E. Fernandez & James M. Noon & Sharon R. Ennis, 2017. "America’s Churning Races: Race and Ethnicity Response Changes Between Census 2000 and the 2010 Census," Demography, Springer;Population Association of America (PAA), vol. 54(1), pages 259-284, February.
    17. John M. Abowd & William R. Bell & J. David Brown & Michael B. Hawes & Misty L. Heggeness & Andrew D. Keller & Vincent T. Mule Jr. & Joseph L. Schafer & Matthew Spence & Lawrence Warren & Moises Yi, 2020. "Determination of the 2020 U.S. Citizen Voting Age Population (CVAP) Using Administrative Records and Statistical Methodology Technical Report," Working Papers 20-33, Center for Economic Studies, U.S. Census Bureau.
    18. Catherine G. Massey, 2014. "Creating Linked Historical Data: An Assessment of the Census Bureau’s Ability to Assign Protected Identification Keys to the 1960 Census," CARRA Working Papers 2014-12, Center for Economic Studies, U.S. Census Bureau.
    19. Mulry Mary H. & Keller Andrew D., 2017. "Comparison of 2010 Census Nonresponse Follow-Up Proxy Responses with Administrative Records Using Census Coverage Measurement Results," Journal of Official Statistics, Sciendo, vol. 33(2), pages 455-475, June.
    20. Bastian, Jacob E. & Jones, Maggie R., 2021. "Do EITC expansions pay for themselves? Effects on tax revenue and government transfers," Journal of Public Economics, Elsevier, vol. 196(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. John Carter Braxton & Kyle F. Herkenhoff & Jonathan Rothbaum & Lawrence Schmidt, 2021. "Changing Income Risk across the US Skill Distribution: Evidence from a Generalized Kalman Filter," Opportunity and Inclusive Growth Institute Working Papers 55, Federal Reserve Bank of Minneapolis.
    2. Colmer, Jonathan & Lin, Dajun & Liu, Siying & Shimshack, Jay, 2021. "Why are pollution damages lower in developed countries? Insights from high-Income, high-particulate matter Hong Kong," Journal of Health Economics, Elsevier, vol. 79(C).
    3. Bruce D. Meyer & Derek Wu & Victoria R. Mooers & Carla Medalia, 2019. "The use and misuse of income data and extreme poverty in the United States," AEI Economics Working Papers 1018925, American Enterprise Institute.
    4. Josef Schürle, 2005. "A method for consideration of conditional dependencies in the Fellegi and Sunter model of record linkage," Statistical Papers, Springer, vol. 46(3), pages 433-449, July.
    5. Robert Collinson & John Eric Humphries & Nicholas Mader & Davin Reed & Daniel Tannenbaum & Winnie van Dijk, 2024. "Eviction and Poverty in American Cities," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 139(1), pages 57-120.
    6. Illenin Kondo & Kevin Rinz & Natalie Gubbay & Brandon Hawkins & John Voorheis & Abigail Wozniak, 2024. "Granular Income Inequality and Mobility Using IDDA: Exploring Patterns across Race and Ethnicity," NBER Chapters, in: Race, Ethnicity, and Economic Statistics for the 21st Century, National Bureau of Economic Research, Inc.
    7. Ufuk Akcigit & Nathan Goldschlag, 2022. "Measuring the Characteristics and Employment Dynamics of U.S. Inventors," Working Papers 22-43, Center for Economic Studies, U.S. Census Bureau.
    8. Nicholas Jones & Eric Jensen & Karen Battle & Rachel Marks, 2024. "Measuring the Racial and Ethnic Composition and Diversity of the United States Population: Historical Challenges and Contemporary Opportunities," NBER Chapters, in: Race, Ethnicity, and Economic Statistics for the 21st Century, National Bureau of Economic Research, Inc.
    9. Afshin Fallah & Mohsen Mohammadzadeh, 2010. "Bayesian regression analysis with linked data using mixture normal distributions," Statistical Papers, Springer, vol. 51(2), pages 421-430, June.
    10. Kevin Rinz, 2022. "Did Timing Matter? Life Cycle Differences in Effects of Exposure to the Great Recession," Journal of Labor Economics, University of Chicago Press, vol. 40(3), pages 703-735.
    11. Kevin L. McKinney & John M. Abowd, 2024. "Estimating the Potential Impact of Combined Race and Ethnicity Reporting on Long-Term Earnings Statistics," Working Papers 24-48, Center for Economic Studies, U.S. Census Bureau.
    12. Matthew Cefalu & John Sullivan & Narayan Sastry & Elizabeth Fussell & Todd Gardner, 2024. "Gradient Boosting to Address Statistical Problems Arising from Non-Linkage of Census Bureau Datasets," Working Papers 24-27, Center for Economic Studies, U.S. Census Bureau.
    13. Thomas Stringham, 2022. "Fast Bayesian Record Linkage With Record-Specific Disagreement Parameters," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(4), pages 1509-1522, October.
    14. Kevin L. McKinney & John M. Abowd, 2024. "Estimating the Potential Impact of Combined Race and Ethnicity Reporting on Long-Term Earnings Statistics," NBER Chapters, in: Race, Ethnicity, and Economic Statistics for the 21st Century, National Bureau of Economic Research, Inc.
    15. Jonathan M. Colmer & John L. Voorheis, 2024. "Microdata and the Valuation of Natural Capital," NBER Chapters, in: Measuring and Accounting for Environmental Public Goods: A National Accounts Perspective, National Bureau of Economic Research, Inc.
    16. Colmer, Jonathan & Voorheis, John, 2020. "The grandkids aren't alright: the intergenerational effects of prenatal pollution exposure," LSE Research Online Documents on Economics 108495, London School of Economics and Political Science, LSE Library.
    17. Catherine Buffington & Benjamin Cerf & Christina Jones & Bruce A. Weinberg, 2016. "STEM Training and Early Career Outcomes of Female and Male Graduate Students: Evidence from UMETRICS Data Linked to the 2010 Census," American Economic Review, American Economic Association, vol. 106(5), pages 333-338, May.
    18. Debabrata Dey, 2003. "Record Matching in Data Warehouses: A Decision Model for Data Consolidation," Operations Research, INFORMS, vol. 51(2), pages 240-254, April.
    19. Sarah Miller & Norman Johnson & Laura R Wherry, 2021. "Medicaid and Mortality: New Evidence From Linked Survey and Administrative Data," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 136(3), pages 1783-1829.
    20. Javier Miranda & Nikolas Zolas, 2017. "Measuring the Impact of Household Innovation Using Administrative Data," NBER Chapters, in: Measuring and Accounting for Innovation in the Twenty-First Century, pages 61-102, National Bureau of Economic Research, Inc.

    More about this item

    Keywords

    false match rate; PVS;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cen:cpaper:2014-02. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Dawn Anderson (email available below). General contact details of provider: https://edirc.repec.org/data/cesgvus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.