IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2505.09399.html
   My bibliography  Save this paper

Augmenting the availability of historical GDP per capita estimates through machine learning

Author

Listed:
  • Philipp Koch
  • Viktor Stojkoski
  • C'esar A. Hidalgo

Abstract

Can we use data on the biographies of historical figures to estimate the GDP per capita of countries and regions? Here we introduce a machine learning method to estimate the GDP per capita of dozens of countries and hundreds of regions in Europe and North America for the past 700 years starting from data on the places of birth, death, and occupations of hundreds of thousands of historical figures. We build an elastic net regression model to perform feature selection and generate out-of-sample estimates that explain 90% of the variance in known historical income levels. We use this model to generate GDP per capita estimates for countries, regions, and time periods for which this data is not available and externally validate our estimates by comparing them with four proxies of economic output: urbanization rates in the past 500 years, body height in the 18th century, wellbeing in 1850, and church building activity in the 14th and 15th century. Additionally, we show our estimates reproduce the well-known reversal of fortune between southwestern and northwestern Europe between 1300 and 1800 and find this is largely driven by countries and regions engaged in Atlantic trade. These findings validate the use of fine-grained biographical data as a method to produce historical GDP per capita estimates. We publish our estimates with confidence intervals together with all collected source data in a comprehensive dataset.

Suggested Citation

  • Philipp Koch & Viktor Stojkoski & C'esar A. Hidalgo, 2025. "Augmenting the availability of historical GDP per capita estimates through machine learning," Papers 2505.09399, arXiv.org.
  • Handle: RePEc:arx:papers:2505.09399
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2505.09399
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Dario Diodato & Andrea Morrison & Sergio Petralia, 2022. "Migration and invention in the Age of Mass Migration [Immigration in American economic history]," Journal of Economic Geography, Oxford University Press, vol. 22(2), pages 477-498.
    2. Sascha O. Becker & Ludger Woessmann, 2009. "Was Weber Wrong? A Human Capital Theory of Protestant Economic History," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 124(2), pages 531-596.
    3. Ufuk Akcigit & John Grigsby & Tom Nicholas, 2017. "The Rise of American Ingenuity: Innovation and Inventors of the Golden Age," Working Papers 2017-6, Princeton University. Economics Department..
    4. Joerg Baten & Matthias Blum, 2014. "Why are you tall while others are short? Agricultural production and other proximate determinants of global heights," European Review of Economic History, European Historical Economics Society, vol. 18(2), pages 144-165.
    5. Bahar, Dany & Choudhury, Prithwiraj & Rapoport, Hillel, 2020. "Migrant inventors and the technological advantage of nations," Research Policy, Elsevier, vol. 49(9).
    6. Mikołaj Malinowski & Jan Luiten Zanden, 2017. "Income and its distribution in preindustrial Poland," Cliometrica, Springer;Cliometric Society (Association Francaise de Cliométrie), vol. 11(3), pages 375-404, September.
    7. Koch, Philipp, 2021. "Economic complexity and growth: Can value-added exports better explain the link?," Economics Letters, Elsevier, vol. 198(C).
    8. Guanghua Chi & Han Fang & Sourav Chatterjee & Joshua E. Blumenstock, 2022. "Microestimates of wealth for all low- and middle-income countries," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 119(3), pages 2113658119-, January.
    9. Cesar A. Hidalgo & Ricardo Hausmann, 2009. "The Building Blocks of Economic Complexity," Papers 0909.3890, arXiv.org.
    10. Daron Acemoglu & Davide Cantoni & Simon Johnson & James A. Robinson, 2011. "The Consequences of Radical Reform: The French Revolution," American Economic Review, American Economic Association, vol. 101(7), pages 3286-3307, December.
    11. David de la Croix & Omar Licandro, 2015. "The longevity of famous people from Hammurabi to Einstein," Journal of Economic Growth, Springer, vol. 20(3), pages 263-303, September.
    12. Ernest Miguelez & Andrea Morrison, 2023. "Migrant inventors as agents of technological change," The Journal of Technology Transfer, Springer, vol. 48(2), pages 669-692, April.
    13. Broadberry,Stephen & Campbell,Bruce M. S. & Klein,Alexander & Overton,Mark & van Leeuwen,Bas, 2015. "British Economic Growth, 1270–1870," Cambridge Books, Cambridge University Press, number 9781107070783, June.
    14. Mara P. Squicciarini & Nico Voigtländer, 2015. "Human Capital and Industrialization: Evidence from the Age of Enlightenment," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 130(4), pages 1825-1883.
    15. Brinkman, Henk Jan & Drukker, J. W. & Slot, Brigitte, 1988. "Height and income: A new method for the estimation of historical national income series," Explorations in Economic History, Elsevier, vol. 25(3), pages 227-264, July.
    16. Borowiecki, Karol Jan & Graddy, Kathryn, 2021. "Immigrant artists: Enrichment or displacement?," Journal of Economic Behavior & Organization, Elsevier, vol. 191(C), pages 785-797.
    17. Jeremiah E. Dittmar, 2011. "Information Technology and Economic Change: The Impact of The Printing Press," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 126(3), pages 1133-1172.
    18. Viktor Stojkoski & Zoran Utkovski & Ljupco Kocarev, 2016. "The Impact of Services on Economic Complexity: Service Sophistication as Route for Economic Growth," PLOS ONE, Public Library of Science, vol. 11(8), pages 1-29, August.
    19. De Long, J Bradford & Shleifer, Andrei, 1993. "Princes and Merchants: European City Growth before the Industrial Revolution," Journal of Law and Economics, University of Chicago Press, vol. 36(2), pages 671-702, October.
    20. Esther Rolf & Jonathan Proctor & Tamma Carleton & Ian Bolliger & Vaishaal Shankar & Miyabi Ishihara & Benjamin Recht & Solomon Hsiang, 2021. "A generalizable and accessible approach to machine learning with global satellite imagery," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    21. J. Vernon Henderson & Adam Storeygard & David N. Weil, 2012. "Measuring Economic Growth from Outer Space," American Economic Review, American Economic Association, vol. 102(2), pages 994-1028, April.
    22. Hartmann, Dominik & Guevara, Miguel R. & Jara-Figueroa, Cristian & Aristarán, Manuel & Hidalgo, César A., 2017. "Linking Economic Complexity, Institutions, and Income Inequality," World Development, Elsevier, vol. 93(C), pages 75-93.
    23. Kerstin Enflo & Anna Missiaia, 2018. "Regional GDP estimates for Sweden, 1571–1850," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 51(2), pages 115-137, April.
    24. Romero, João P. & Gramkow, Camila, 2021. "Economic complexity and greenhouse gas emissions," World Development, Elsevier, vol. 139(C).
    25. Viktor Stojkoski & Philipp Koch & C'esar A. Hidalgo, 2022. "Multidimensional Economic Complexity and Inclusive Green Growth," Papers 2209.08382, arXiv.org, revised Apr 2023.
    26. Becker, Sascha O. & Pfaff, Steven & Rubin, Jared, 2016. "Causes and consequences of the Protestant Reformation," Explorations in Economic History, Elsevier, vol. 62(C), pages 1-25.
    27. Jutta Bolt & Jan Luiten Zanden, 2014. "The Maddison Project: collaborative research on historical national accounts," Economic History Review, Economic History Society, vol. 67(3), pages 627-651, August.
    28. Davide Cantoni, 2015. "The Economic Effects Of The Protestant Reformation: Testing The Weber Hypothesis In The German Lands," Journal of the European Economic Association, European Economic Association, vol. 13(4), pages 561-598, August.
    29. Ewa S. Callahan & Susan C. Herring, 2011. "Cultural bias in Wikipedia content on famous persons," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(10), pages 1899-1915, October.
    30. Klasing, Mariko J. & Milionis, Petros, 2014. "Quantifying the evolution of world trade, 1870–1949," Journal of International Economics, Elsevier, vol. 92(1), pages 185-197.
    31. Balland, Pierre-Alexandre & Broekel, Tom & Diodato, Dario & Giuliani, Elisa & Hausmann, Ricardo & O'Clery, Neave & Rigby, David, 2022. "Reprint of The new paradigm of economic complexity," Research Policy, Elsevier, vol. 51(8).
    32. C. A. Hidalgo & B. Klinger & A. -L. Barabasi & R. Hausmann, 2007. "The Product Space Conditions the Development of Nations," Papers 0708.2090, arXiv.org.
    33. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    34. Balland, Pierre-Alexandre & Broekel, Tom & Diodato, Dario & Giuliani, Elisa & Hausmann, Ricardo & O'Clery, Neave & Rigby, David, 2022. "The new paradigm of economic complexity," Research Policy, Elsevier, vol. 51(3).
    35. Karol Jan Borowiecki, 2012. "Are composers different? Historical evidence on conflict-induced migration (1816-1997)," European Review of Economic History, European Historical Economics Society, vol. 16(3), pages 270-291, August.
    36. Allen, Robert C., 2001. "The Great Divergence in European Wages and Prices from the Middle Ages to the First World War," Explorations in Economic History, Elsevier, vol. 38(4), pages 411-447, October.
    37. repec:plo:pone00:0107042 is not listed on IDEAS
    38. Pfister, Ulrich, 2022. "Economic Growth in Germany, 1500–1850," The Journal of Economic History, Cambridge University Press, vol. 82(4), pages 1071-1107, December.
    39. Ewa S. Callahan & Susan C. Herring, 2011. "Cultural bias in Wikipedia content on famous persons," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 62(10), pages 1899-1915, October.
    40. Ridolfi, Leonardo, 2019. "Six Centuries of Real Wages in France from Louis IX to Napoleon III: 1250–1860," The Journal of Economic History, Cambridge University Press, vol. 79(3), pages 589-627, September.
    41. Erik Hornung, 2014. "Immigration and the Diffusion of Technology: The Huguenot Diaspora in Prussia," American Economic Review, American Economic Association, vol. 104(1), pages 84-122, January.
    42. Jim Giles, 2005. "Internet encyclopaedias go head to head," Nature, Nature, vol. 438(7070), pages 900-901, December.
    43. Pereira, Alvaro S., 2009. "The Opportunity of a Disaster: The Economic Impact of the 1755 Lisbon Earthquake," The Journal of Economic History, Cambridge University Press, vol. 69(2), pages 466-499, June.
    44. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    45. Krantz, Olle, 2017. "Swedish GDP 1300-1560: A Tentative Estimate," Lund Papers in Economic History 152, Lund University, Department of Economic History.
    46. Schön, Lennart & Krantz, Olle, 2015. "New Swedish Historical National Accounts since the 16th Century in Constant and Current Prices," Lund Papers in Economic History 140, Lund University, Department of Economic History.
    47. Mokyr, Joel & O Grada, Cormac, 1996. "Height and Health in the United Kingdom 1815-1860: Evidence from the East India Company Army," Explorations in Economic History, Elsevier, vol. 33(2), pages 141-168, April.
    48. Malanima, Paolo, 2011. "The long decline of a leading economy: GDP in central and northern Italy, 1300–1913," European Review of Economic History, Cambridge University Press, vol. 15(2), pages 169-219, August.
    49. Donghyun Ahn & Jeasurk Yang & Meeyoung Cha & Hyunjoo Yang & Jihee Kim & Sangyoon Park & Sungwon Han & Eunji Lee & Susang Lee & Sungwon Park, 2023. "A human-machine collaborative approach measures economic development using satellite imagery," Nature Communications, Nature, vol. 14(1), pages 1-10, December.
    50. Lapatinas, Athanasios, 2019. "The effect of the Internet on economic sophistication: An empirical analysis," Economics Letters, Elsevier, vol. 174(C), pages 35-38.
    51. van Zanden, Jan Luiten & van Leeuwen, Bas, 2012. "Persistent but not consistent: The growth of national income in Holland 1347–1807," Explorations in Economic History, Elsevier, vol. 49(2), pages 119-130.
    52. Isabella M Weber & Gregor Semieniuk & Tom Westland & Junshang Liang, 2021. "What You Exported Matters: Persistence in Productive Capabilities across Two Eras of Globalization," UMASS Amherst Economics Working Papers 2021-02, University of Massachusetts Amherst, Department of Economics.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hidalgo, César A., 2023. "The policy implications of economic complexity," Research Policy, Elsevier, vol. 52(9).
    2. Viktor Stojkoski & Philipp Koch & Eva Coll & César A. Hidalgo, 2024. "Estimating digital product trade through corporate revenue data," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    3. Deseau, Arnaud, 2024. "Speed of convergence in a Malthusian world: Weak or strong homeostasis?," Explorations in Economic History, Elsevier, vol. 94(C).
    4. Stojkoski, Viktor & Hidalgo, César, 2025. "Optimizing Economic Complexity," TSE Working Papers 24-1623, Toulouse School of Economics (TSE).
    5. Arnaud Deseau, 2023. "Speed of Convergence in a Malthusian World: Weak or Strong Homeostasis?," AMSE Working Papers 2326, Aix-Marseille School of Economics, France.
    6. Alexander Donges & Jean-Marie Meier & Rui C. Silva, 2023. "The Impact of Institutions on Innovation," Management Science, INFORMS, vol. 69(4), pages 1951-1974, April.
    7. Palma, Nuno & Reis, Jaime & Rodrigues, Lisbeth, 2023. "Historical gender discrimination does not explain comparative Western European development: evidence from Portugal, 1300-1900," Explorations in Economic History, Elsevier, vol. 88(C).
    8. Lecce, Giampaolo & Ogliari, Laura, 2019. "Institutional Transplant and Cultural Proximity: Evidence from Nineteenth-Century Prussia," The Journal of Economic History, Cambridge University Press, vol. 79(4), pages 1060-1093, December.
    9. Bernardo Caldarola & Dario Mazzilli & Lorenzo Napolitano & Aurelio Patelli & Angelica Sbardella, 2023. "Economic complexity and the sustainability transition: A review of data, methods, and literature," Papers 2308.07172, arXiv.org, revised Mar 2024.
    10. C'esar A. Hidalgo, 2022. "The Policy Implications of Economic Complexity," Papers 2205.02164, arXiv.org, revised Aug 2023.
    11. Broadberry, Stephen & Lennard, Jason, 2024. "European business cycles and economic growth, 1300–2000," Explorations in Economic History, Elsevier, vol. 94(C).
    12. Francesco Cinnirella & Jochen Streb, 2017. "Religious Tolerance as Engine of Innovation," CESifo Working Paper Series 6797, CESifo.
    13. Binzel, Christine & Link, Andreas & Ramachandran, Rajesh, 2021. "Language, Knowledge, and Growth: Evidence from Early Modern Europe," CEPR Discussion Papers 15454, C.E.P.R. Discussion Papers.
    14. Cesar A. Hidalgo, 2022. "Knowledge is non-fungible," Papers in Evolutionary Economic Geography (PEEG) 2229, Utrecht University, Department of Human Geography and Spatial Planning, Group Economic Geography, revised Nov 2022.
    15. Alexandra M. de Pleijt & Jan Luiten van Zanden, 2016. "Accounting for the “Little Divergence”: What drove economic growth in pre-industrial Europe, 1300–1800?," European Review of Economic History, European Historical Economics Society, vol. 20(4), pages 387-409.
    16. Davide Cantoni & Noam Yuchtman, 2020. "Historical Natural Experiments: Bridging Economics and Economic History," NBER Working Papers 26754, National Bureau of Economic Research, Inc.
    17. Mara P. Squicciarini & Nico Voigtländer, 2015. "Human Capital and Industrialization: Evidence from the Age of Enlightenment," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 130(4), pages 1825-1883.
    18. Uribe, Jorge M., 2025. "Investment in intangible assets and economic complexity," Research Policy, Elsevier, vol. 54(1).
    19. Alfani, Guido & Gierok, Victoria & Schaff, Felix, 2025. "Poverty in Germany from the Black Death until the Beginning of Industrialization," Explorations in Economic History, Elsevier, vol. 95(C).
    20. Greif, Gavin, 2022. "Merchants, proto-firms, and the German industrialization: the commercial determinants of nineteenth century town growth," LSE Research Online Documents on Economics 113346, London School of Economics and Political Science, LSE Library.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2505.09399. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.