IDEAS home Printed from https://ideas.repec.org/p/cge/wacage/568.html

The Problem of False Positives in Automated Census Linking: Evidence from Nineteenth-Century New York's Irish Immigrants

Author

Listed:
  • Anbinder, Tyler

    (George Washington University)

  • Connor, Dylan

    (Arizona State University)

  • O Grada, Cormac

    (University College, Dublin)

  • Wegge, Simone

    (College of Staten Island and The Graduate Center—CUNY)

Abstract

Automated census linkage algorithms have become popular for generating longitudinal data on social mobility, especially for immigrants and their children. But what if these algorithms are particularly bad at tracking immigrants? Using nineteenth-century Irish immigrants as a test case, we examine the most popular of these algorithms—that created by Abramitzky, Boustan, Eriksson (ABE), and their collaborators. Our findings raise serious questions about the quality of automated census links. False positives range from about one-third to one-half of all links depending on the ABE variant used. These bad links lead to sizeable estimation errors when measuring Irish immigrant social mobility.

Suggested Citation

  • Anbinder, Tyler & Connor, Dylan & O Grada, Cormac & Wegge, Simone, 2021. "The Problem of False Positives in Automated Census Linking: Evidence from Nineteenth-Century New York's Irish Immigrants," CAGE Online Working Paper Series 568, Competitive Advantage in the Global Economy (CAGE).
  • Handle: RePEc:cge:wacage:568
    as

    Download full text from publisher

    File URL: https://warwick.ac.uk/fac/soc/economics/research/centres/cage/manage/publications/wp568.2021.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Catherine G. Massey, 2017. "Playing with matches: An assessment of accuracy in linked historical data," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 50(3), pages 129-143, July.
    2. Kosack, Edward & Ward, Zachary, 2020. "El Sueño Americano? The Generational Progress of Mexican Americans Prior to World War II," The Journal of Economic History, Cambridge University Press, vol. 80(4), pages 961-995, December.
    3. Herscovici, Steven, 1998. "Migration and Economic Mobility: Wealth Accumulation and Occupational Change Among Antebellum Migrants and Persisters," The Journal of Economic History, Cambridge University Press, vol. 58(4), pages 927-956, December.
    4. Herscovici, Steven, 1997. "Progress Amid Poverty: Economic Opportunity in Antebellum Newburyport," The Journal of Economic History, Cambridge University Press, vol. 57(2), pages 484-488, June.
    5. Ran Abramitzky & Leah Platt Boustan & Katherine Eriksson, 2012. "Europe's Tired, Poor, Huddled Masses: Self-Selection and Economic Outcomes in the Age of Mass Migration," American Economic Review, American Economic Association, vol. 102(5), pages 1832-1856, August.
    6. Cormac Ó Gráda, 2005. "The New York Irish in the 1850s : locked in by poverty?," Open Access publications 10197/489, School of Economics, University College Dublin.
    7. Eugene N. White & Cormac Ó Gráda, 2003. "The panics of 1854 and 1857 : a view from the Emigration Industrial Savings Bank," Open Access publications 10197/438, School of Economics, University College Dublin.
    8. Marco Breschi & Lucia Pozzi & Stanislao Mazzoni & Massimo Esposito, 2014. "Fertility transition and social stratification in the town of Alghero, Sardinia (1866-1935)," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 30(28), pages 823-852.
    9. Simone A. Wegge & Tyler Anbinder & Cormac Ó Gráda, 2017. "Immigrants and savers: A rich new database on the Irish in 1850s New York," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 50(3), pages 144-155, July.
    10. Ó Gráda, Cormac & White, Eugene N., 2003. "The Panics of 1854 and 1857: A View from the Emigrant Industrial Savings Bank," The Journal of Economic History, Cambridge University Press, vol. 63(1), pages 213-240, March.
    11. Dorothee Crayen & Joerg Baten, 2010. "New evidence and new methods to measure human capital inequality before and during the industrial revolution: France and the US in the seventeenth to nineteenth centuries," Economic History Review, Economic History Society, vol. 63(2), pages 452-478, May.
    12. Philipp Ager & Leah Boustan & Katherine Eriksson, 2021. "The Intergenerational Effects of a Large Wealth Shock: White Southerners after the Civil War," American Economic Review, American Economic Association, vol. 111(11), pages 3767-3794, November.
    13. Zachary Ward, 2023. "Intergenerational Mobility in American History: Accounting for Race and Measurement Error," American Economic Review, American Economic Association, vol. 113(12), pages 3213-3248, December.
    14. Matthias Blum & Christopher L. Colvin & Laura McAtackney & Eoin McLaughlin, 2017. "Women of an uncertain age: quantifying human capital accumulation in rural Ireland in the nineteenth century," Economic History Review, Economic History Society, vol. 70(1), pages 187-223, February.
    15. Martin Dribe & J. David Hacker & Francesco Scalone, 2014. "The impact of socio-economic status on net fertility during the historical fertility decline: A comparative analysis of Canada, Iceland, Sweden, Norway, and the USA," Population Studies, Taylor & Francis Journals, vol. 68(2), pages 135-149, July.
    16. Chris Vickers & Nicolas L. Ziebarth, 2016. "Economic Development and the Demographics of Criminals in Victorian England," Journal of Law and Economics, University of Chicago Press, vol. 59(1), pages 191-223.
    17. A'Hearn, Brian & Baten, Jörg & Crayen, Dorothee, 2009. "Quantifying Quantitative Literacy: Age Heaping and the History of Human Capital," The Journal of Economic History, Cambridge University Press, vol. 69(3), pages 783-808, September.
    18. Cirenza, Peter, 2015. "Geography and assimilation: a case study of Irish immigrants in late nineteenth century America," Economic History Working Papers 60964, London School of Economics and Political Science, Department of Economic History.
    19. Jason Long & Joseph Ferrie, 2018. "Grandfathers Matter(ed): Occupational Mobility Across Three Generations in the US and Britain, 1850–1911," Economic Journal, Royal Economic Society, vol. 128(612), pages 422-445, July.
    20. Mokyr, Joel & Grada, Cormac O, 1982. "Emigration and poverty in prefamine Ireland," Explorations in Economic History, Elsevier, vol. 19(4), pages 360-384, October.
    21. Cormac O Grada & Morgan Kelly, 2000. "Market Contagion: Evidence from the Panics of 1854 and 1857," American Economic Review, American Economic Association, vol. 90(5), pages 1110-1124, December.
    22. Ran Abramitzky & Roy Mill & Santiago Pérez, 2020. "Linking individuals across historical sources: A fully automated approach," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 53(2), pages 94-111, April.
    23. Connor, Dylan Shane, 2019. "The Cream of the Crop? Geography, Networks, and Irish Migrant Selection in the Age of Mass Migration," The Journal of Economic History, Cambridge University Press, vol. 79(1), pages 139-175, March.
    24. Alter, George & Goldin, Claudia & Rotella, Elyce, 1994. "The Savings of Ordinary Americans: The Philadelphia Saving Fund Society in the Mid-Nineteenth Century," The Journal of Economic History, Cambridge University Press, vol. 54(4), pages 735-767, December.
    25. Aaronson, Daniel & Davis, Jonathan & Schulze, Karl, 2020. "Internal immigrant mobility in the early 20th century: evidence from Galveston, Texas," Explorations in Economic History, Elsevier, vol. 76(C).
    26. Price, Joseph & Buckles, Kasey & Van Leeuwen, Jacob & Riley, Isaac, 2021. "Combining family history and machine learning to link historical records: The Census Tree data set," Explorations in Economic History, Elsevier, vol. 80(C).
    27. Martha J. Bailey & Connor Cole & Morgan Henderson & Catherine Massey, 2020. "How Well Do Automated Linking Methods Perform? Lessons from US Historical Data," Journal of Economic Literature, American Economic Association, vol. 58(4), pages 997-1044, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zimran, Ariell, 2022. "US immigrants’ secondary migration and geographic assimilation during the Age of Mass Migration," Explorations in Economic History, Elsevier, vol. 85(C).
    2. Zhu, Ziming, 2022. "Like father like son? Intergenerational immobility in England, 1851-1911," LSE Research Online Documents on Economics 117588, London School of Economics and Political Science, LSE Library.
    3. Dora L. Costa & Coralee Lewis & Noelle Yetter, 2023. "Children and grandchildren of Union Army veterans: New data collections to study the persistence of longevity and socioeconomic status across generations," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 56(4), pages 223-239, October.
    4. Zhu, Ziming, 2022. "Like father like son? Intergenerational immobility in England, 1851-1911," Economic History Working Papers 117588, London School of Economics and Political Science, Department of Economic History.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Eric S. M. Protzer & Sultan Orazbayev & Andres Gomez-Lievano & Matte Hartog & Frank Neffke, 2024. "A New Algorithm to Efficiently Match U.S. Census Records and Balance Representativity with Match Quality," Growth Lab Working Papers 238, Harvard's Growth Lab.
    2. Dahl, Christian M. & Johansen, Torben S.D. & Sørensen, Emil N. & Wittrock, Simon, 2023. "HANA: A handwritten name database for offline handwritten text recognition," Explorations in Economic History, Elsevier, vol. 87(C).
    3. Hwang, Sam Il Myoung & Squires, Munir, 2024. "Linked samples and measurement error in historical US census data," Explorations in Economic History, Elsevier, vol. 93(C).
    4. Simone A. Wegge & Tyler Anbinder & Cormac Ó Gráda, 2017. "Immigrants and savers: A rich new database on the Irish in 1850s New York," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 50(3), pages 144-155, July.
    5. Lehmann-Hasemeyer, Sibylle & Neumayer, Andreas & Streb, Jochen, 2023. "Heterogeneous inflation and deflation experiences and savings decisions during German industrialization," Journal of Banking & Finance, Elsevier, vol. 154(C).
    6. Collins, William J. & Zimran, Ariell, 2019. "The economic assimilation of Irish Famine migrants to the United States," Explorations in Economic History, Elsevier, vol. 74(C).
    7. Toni Ricardo Eugenio dos Santos & Marcio Issao Nakane, 2021. "Dynamic bank runs: an agent-based approach," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 16(3), pages 675-703, July.
    8. Bennett, Robert J. & Montebruno, Piero & Van Lieshout, Carry & Smith, Harry, 2022. "Business entry and exit: career changes of proprietors in England and Wales (1851-81) using record-linkage," LSE Research Online Documents on Economics 113867, London School of Economics and Political Science, LSE Library.
    9. Lehmann-Hasemeyer, Sibylle H. & Neumayer, Andreas & Streb, Jochen, 2022. "Heterogeneous savers and their inflation expectation during German industrialization: Social class, wealth, and gender," Working Papers 33, German Research Foundation's Priority Programme 1859 "Experience and Expectation. Historical Foundations of Economic Behaviour", Humboldt University Berlin.
    10. Nobel Prize Committee, 2022. "Financial Intermediation and the Economy," Nobel Prize in Economics documents 2022-2, Nobel Prize Committee.
    11. Zhiguo He & Asaf Manela, 2016. "Information Acquisition in Rumor‐Based Bank Runs," Journal of Finance, American Finance Association, vol. 71(3), pages 1113-1158, June.
    12. Elisa Jácome & Ilyana Kuziemko & Suresh Naidu, 2021. "Mobility for All: Representative Intergenerational Mobility Estimates over the 20th Century," Working Papers 302, Princeton University, Department of Economics, Center for Economic Policy Studies..
    13. Hanna M. Schwank, 2024. "Disruptive Effects of Natural Disasters: The 1906 San Francisco Fire," ECONtribute Discussion Papers Series 312, University of Bonn and University of Cologne, Germany.
    14. Ran Abramitzky & Roy Mill & Santiago Pérez, 2020. "Linking individuals across historical sources: A fully automated approach," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 53(2), pages 94-111, April.
    15. Martha J. Bailey & Peter Z. Lin, 2024. "Marital Matching and Women’s Intergenerational Mobility in the Late 19th and Early 20th Century US," NBER Chapters, in: The Economic History of American Inequality: New Evidence and Perspectives, pages 165-196, National Bureau of Economic Research, Inc.
    16. Hubert J. Kiss & Ismael Rodriguez-Lara & Alfonso Rosa-Garcia, 2022. "Experimental bank runs," Chapters, in: Sascha Füllbrunn & Ernan Haruvy (ed.), Handbook of Experimental Finance, chapter 25, pages 347-361, Edward Elgar Publishing.
      • Hubert J. Kiss & Ismael Rodriguez-Lara & Alfonso Rosa-Garcia, 2021. "Experimental Bank Runs," ThE Papers 21/03, Department of Economic Theory and Economic History of the University of Granada..
    17. Kristian Blickle & Markus Brunnermeier & Stephan Luck, 2020. "Micro-evidence from a System-wide Financial Meltdown: The German Crisis of 1931," Working Papers 275, Princeton University, Department of Economics, Center for Economic Policy Studies..
    18. Gary Gorton, 2008. "The Subprime Panic," Yale School of Management Working Papers amz2504, Yale School of Management.
    19. Beltrán Tapia, Francisco J. & Díez-Minguela, Alfonso & Martinez-Galarraga, Julio & Tirado-Fabregat, Daniel A., 2022. "Two Stories, One Fate: Age-Heaping And Literacy In Spain, 1877-1930," Revista de Historia Económica / Journal of Iberian and Latin American Economic History, Cambridge University Press, vol. 40(3), pages 405-438, December.
    20. Howard Bodenhorn, 2017. "Finance and Growth: Household Savings, Public Investment, and Public Health in Late Nineteenth-Century New Jersey," NBER Working Papers 23430, National Bureau of Economic Research, Inc.

    More about this item

    JEL classification:

    • N21 - Economic History - - Financial Markets and Institutions - - - U.S.; Canada: Pre-1913
    • J61 - Labor and Demographic Economics - - Mobility, Unemployment, Vacancies, and Immigrant Workers - - - Geographic Labor Mobility; Immigrant Workers
    • R23 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - Household Analysis - - - Regional Migration; Regional Labor Markets; Population

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cge:wacage:568. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Jane Snape (email available below). General contact details of provider: https://edirc.repec.org/data/dewaruk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.