IDEAS home Printed from https://ideas.repec.org/p/cpr/ceprdp/15852.html

A Cross-verified Database of Notable People, 3500BC-2018AD

Author

Listed:
  • Wasmer, Etienne
  • Laouenan, Morgane
  • Bhargava, Palaash
  • Eymeoud, Jean Benoit
  • Plique, Guillaume

Abstract

We add to the literature on notable individuals (famous, prominent, distinguished) in collecting first a massive amount of data from various editions of Wikipedia and Wikidata along with deduplication techniques; and then using these partially overlapping sources to cross-verify each retrieved information. This strategy results in a cross-verified database of 2.2 million individuals, including a third who are not present in the English edition of Wikipedia. An extension to 4.7 million entries is currently not recommended given the inaccuracy of the information and discrepancies between Wikidata and other sources. A non-negligible fraction of newly-added individuals were collected from non-English editions of Wikipedia. We adopt a social science approach: data collection is driven by specific social questions on gender, economic and cul- tural development and quantitative exploration of cultural trends, that we document in this paper. A sample of 100,000 individuals is available here http://medialab.github.io/bhht-datascape, together with the most recent version of this paper.

Suggested Citation

  • Wasmer, Etienne & Laouenan, Morgane & Bhargava, Palaash & Eymeoud, Jean Benoit & Plique, Guillaume, 2021. "A Cross-verified Database of Notable People, 3500BC-2018AD," CEPR Discussion Papers 15852, C.E.P.R. Discussion Papers.
  • Handle: RePEc:cpr:ceprdp:15852
    as

    Download full text from publisher

    File URL: https://cepr.org/publications/DP15852
    Download Restriction: CEPR Discussion Papers are free to download for our researchers, subscribers and members. If you fall into one of these categories but have trouble downloading our papers, please contact us at subscribers@cepr.org
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or

    for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Philippe Aghion & Nick Bloom & Richard Blundell & Rachel Griffith & Peter Howitt, 2005. "Competition and Innovation: an Inverted-U Relationship," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 120(2), pages 701-728.
    2. Reenu, 2008. "Role of Economic Reform in the Growth of Indian Economy," Journal of Commerce and Trade, Society for Advanced Management Studies, vol. 3(1), pages 19-22, April.
    3. Dave Donaldson & Richard Hornbeck, 2016. "Railroads and American Economic Growth: A "Market Access" Approach," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 131(2), pages 799-858.
    4. David de la Croix & Omar Licandro, 2015. "The longevity of famous people from Hammurabi to Einstein," Journal of Economic Growth, Springer, vol. 20(3), pages 263-303, September.
    5. Robert B. Ekelund, Jr. & Robert F. Hebert & Robert D. Tollison, 2002. "An Economic Analysis of the Protestant Reformation," Journal of Political Economy, University of Chicago Press, vol. 110(3), pages 646-671, June.
    6. Kristian Behrens & Gilles Duranton & Frédéric Robert-Nicoud, 2014. "Productive Cities: Sorting, Selection, and Agglomeration," Journal of Political Economy, University of Chicago Press, vol. 122(3), pages 507-553.
    7. Gojko Barjamovic & Thomas Chaney & Kerem Cosar & Ali Hortacsu, 2019. "Trade, Merchants and the Lost Cities of the Bronze Age," SciencePo Working papers hal-03261799, HAL.
    8. Alex Bell & Raj Chetty & Xavier Jaravel & Neviana Petkova & John Van Reenen, 2019. "Who Becomes an Inventor in America? The Importance of Exposure to Innovation," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 134(2), pages 647-713.
    9. Alberto Alesina & Johann Harnoss & Hillel Rapoport, 2016. "Birthplace diversity and economic prosperity," Journal of Economic Growth, Springer, vol. 21(2), pages 101-138, June.
    10. Dao, Thu Hien & Docquier, Frédéric & Parsons, Chris & Peri, Giovanni, 2018. "Migration and development: Dissecting the anatomy of the mobility transition," Journal of Development Economics, Elsevier, vol. 132(C), pages 88-101.
    11. Claudia Goldin, 2014. "A Grand Gender Convergence: Its Last Chapter," American Economic Review, American Economic Association, vol. 104(4), pages 1091-1119, April.
    12. Roland G. Fryer & Steven D. Levitt, 2004. "The Causes and Consequences of Distinctively Black Names," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 119(3), pages 767-805.
    13. Gojko Barjamovic & Thomas Chaney & Kerem Coşar & Ali Hortaçsu, 2019. "Trade, Merchants, and the Lost Cities of the Bronze Age," The Quarterly Journal of Economics, Oxford University Press, vol. 134(3), pages 1455-1503.
    14. Edward L. Glaeser & Rafael La Porta & Florencio Lopez-de-Silanes & Andrei Shleifer, 2004. "Do Institutions Cause Growth?," Journal of Economic Growth, Springer, vol. 9(3), pages 271-303, September.
    15. Oded Galor, 2011. "Unified Growth Theory and Comparative Development," Rivista di Politica Economica, SIPI Spa, issue 2, pages 9-21, April-Jun.
    16. Becker, Sascha O. & Pfaff, Steven & Rubin, Jared, 2016. "Causes and consequences of the Protestant Reformation," Explorations in Economic History, Elsevier, vol. 62(C), pages 1-25.
    17. Allen,Robert C., 2009. "The British Industrial Revolution in Global Perspective," Cambridge Books, Cambridge University Press, number 9780521868273, January.
    18. C Jara-Figueroa & Amy Z Yu & César A Hidalgo, 2019. "How the medium shapes the message: Printing and the rise of the arts and sciences," PLOS ONE, Public Library of Science, vol. 14(2), pages 1-14, February.
    19. Oded Galor & Omer Moav, 2004. "From Physical to Human Capital Accumulation: Inequality and the Process of Development," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 71(4), pages 1001-1026.
    20. Raj Chetty & Nathaniel Hendren & Patrick Kline & Emmanuel Saez, 2014. "Where is the land of Opportunity? The Geography of Intergenerational Mobility in the United States," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 129(4), pages 1553-1623.
    21. Daron Acemoglu & Simon Johnson, 2005. "Unbundling Institutions," Journal of Political Economy, University of Chicago Press, vol. 113(5), pages 949-995, October.
    22. Marianne Bertrand, 2020. "Gender in the Twenty-First Century," AEA Papers and Proceedings, American Economic Association, vol. 110, pages 1-24, May.
    23. Beine, Michel & Docquier, Frederic & Rapoport, Hillel, 2001. "Brain drain and economic growth: theory and evidence," Journal of Development Economics, Elsevier, vol. 64(1), pages 275-289, February.
    24. Crafts, Nicholas, 2011. "Explaining the first Industrial Revolution: two views," European Review of Economic History, Cambridge University Press, vol. 15(1), pages 153-168, April.
    25. La Porta, Rafael & Lopez-de-Silanes, Florencio & Shleifer, Andrei & Vishny, Robert, 1999. "The Quality of Government," The Journal of Law, Economics, and Organization, Oxford University Press, vol. 15(1), pages 222-279, April.
    26. Mark Aguiar & Erik Hurst, 2007. "Measuring Trends in Leisure: The Allocation of Time Over Five Decades," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 122(3), pages 969-1006.
    27. Bisin, Alberto & Verdier, Thierry, 2001. "The Economics of Cultural Transmission and the Dynamics of Preferences," Journal of Economic Theory, Elsevier, vol. 97(2), pages 298-319, April.
    28. Michel Serafinelli & Guido Tabellini, 2022. "Creativity over time and space," Journal of Economic Growth, Springer, vol. 27(1), pages 1-43, March.
    29. Nathaniel Baum-Snow, 2007. "Did Highways Cause Suburbanization?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 122(2), pages 775-805.
    30. Alberto Alesina & Paola Giuliano, 2015. "Culture and Institutions," Journal of Economic Literature, American Economic Association, vol. 53(4), pages 898-944, December.
    31. Gojko Barjamovic & Thomas Chaney & Kerem Coşar & Ali Hortaçsu, 2019. "Trade, Merchants, and the Lost Cities of the Bronze Age," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 134(3), pages 1455-1503.
    32. Michael Kremer, 1993. "Population Growth and Technological Change: One Million B.C. to 1990," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 108(3), pages 681-716.
    33. Amy Finkelstein & Matthew Gentzkow & Heidi Williams, 2021. "Place-Based Drivers of Mortality: Evidence from Migration," American Economic Review, American Economic Association, vol. 111(8), pages 2697-2735, August.
    34. repec:hal:pseose:hal-01304131 is not listed on IDEAS
    35. Hunt, Jennifer & Garant, Jean-Philippe & Herman, Hannah & Munroe, David J., 2013. "Why are women underrepresented amongst patentees?," Research Policy, Elsevier, vol. 42(4), pages 831-843.
    36. Paul J. J. Welfens, 2008. "ICT – productivity and economic growth in Europe," Springer Books, in: Paul J. J. Welfens & Ellen Walther-Klaus (ed.), Digital Excellence, pages 13-39, Springer.
    37. Davide Cantoni, 2015. "The Economic Effects Of The Protestant Reformation: Testing The Weber Hypothesis In The German Lands," Journal of the European Economic Association, European Economic Association, vol. 13(4), pages 561-598, August.
    38. Ran Abramitzky & Leah Boustan & Elisa Jacome & Santiago Perez, 2021. "Intergenerational Mobility of Immigrants in the United States over Two Centuries," American Economic Review, American Economic Association, vol. 111(2), pages 580-608, February.
    39. Oded Galor, 2011. "Unified Growth Theory," Economics Books, Princeton University Press, edition 1, number 9477.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. is not listed on IDEAS
    2. Borowiecki, Karol Jan & Kristensen, Martin Hørlyk & Law, Marc T., 2025. "Where are the female composers? Human capital and gender inequality in music history," European Economic Review, Elsevier, vol. 171(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bennett, Daniel L. & Faria, Hugo J. & Gwartney, James D. & Morales, Daniel R., 2017. "Economic Institutions and Comparative Economic Development: A Post-Colonial Perspective," World Development, Elsevier, vol. 96(C), pages 503-519.
    2. Hanlon, W.Walker & Heblich, Stephan, 2022. "History and urban economics," Regional Science and Urban Economics, Elsevier, vol. 94(C).
    3. Johnson, Noel D. & Koyama, Mark, 2017. "Jewish communities and city growth in preindustrial Europe," Journal of Development Economics, Elsevier, vol. 127(C), pages 339-354.
    4. Quamrul Ashraf & Oded Galor, 2011. "Cultural Diversity, Geographical Isolation, and the Origin of the Wealth of Nations," Center for Development Economics 2011-10, Department of Economics, Williams College.
    5. Braunfels, Elias, 2016. "Further Unbundling Institutions," Discussion Paper Series in Economics 13/2016, Norwegian School of Economics, Department of Economics.
    6. Oded Galor, 2024. "Unified Growth Theory: Roots of Growth and Inequality in the Wealth of Nations," CESifo Working Paper Series 11571, CESifo.
    7. Combes, Pierre-Philippe & Gobillon, Laurent & Zylberberg, Yanos, 2022. "Urban economics in a historical perspective: Recovering data with machine learning," Regional Science and Urban Economics, Elsevier, vol. 94(C).
    8. Fiaschi, Davide & Fioroni, Tamara, 2019. "Transition to modern growth in Great Britain: The role of technological progress, adult mortality and factor accumulation," Structural Change and Economic Dynamics, Elsevier, vol. 51(C), pages 472-490.
    9. Oyèkọ́lá, Ọláyínká, 2021. "Where do people live longer?," Research in Economics, Elsevier, vol. 75(1), pages 21-44.
    10. Anastasia Litina, 2016. "Natural land productivity, cooperation and comparative development," Journal of Economic Growth, Springer, vol. 21(4), pages 351-408, December.
    11. Francesco Cinnirella & Jochen Streb, 2017. "The role of human capital and innovation in economic development: evidence from post-Malthusian Prussia," Journal of Economic Growth, Springer, vol. 22(2), pages 193-227, June.
    12. Canning, David & Mabeu, Marie Christelle & Pongou, Roland, 2020. "Colonial origins and fertility: can the market overcome history?," MPRA Paper 112496, University Library of Munich, Germany.
    13. Boikos, Spyridon & Bucci, Alberto & Stengos, Thanasis, 2022. "Leisure and innovation in horizontal R&D-based growth," Economic Modelling, Elsevier, vol. 107(C).
    14. Broadberry, Stephen & Ghosal, Sayantan & Proto, Eugenio, 2017. "Anonymity, efficiency wages and technological progress," Journal of Development Economics, Elsevier, vol. 127(C), pages 379-394.
    15. Felix S.F. Schaff, 2023. "The Unequal Spirit of the Protestant Reformation: Particularism and Wealth Distribution in Early Modern Germany," Working Papers 0239, European Historical Economics Society (EHES).
    16. Becker, Sascha O. & Rubin, Jared & Woessmann, Ludger, 2020. "Religion in Economic History : A Survey," The Warwick Economics Research Paper Series (TWERPS) 1273, University of Warwick, Department of Economics.
    17. Ko, Chiu Yu & Koyama, Mark & Sng, Tuan-Hwee, 2014. "Unified China; Divided Europe," MPRA Paper 60418, University Library of Munich, Germany.
    18. Jakob B. Madsen & Fabrice Murtin, 2017. "British economic growth since 1270: the role of education," Journal of Economic Growth, Springer, vol. 22(3), pages 229-272, September.
    19. Kemeny, Tom & Petralia, Sergio & Storper, Michael, 2022. "Disruptive innovation and spatial inequality," LSE Research Online Documents on Economics 115953, London School of Economics and Political Science, LSE Library.
    20. Adamson, Jordan, 2025. "Trade and the rise of ancient Greek city-states," Journal of Economic Behavior & Organization, Elsevier, vol. 235(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    JEL classification:

    • N01 - Economic History - - General - - - Development of the Discipline: Historiographical; Sources and Methods
    • N9 - Economic History - - Regional and Urban History
    • R00 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General - - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cpr:ceprdp:15852. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://www.cepr.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.