IDEAS home Printed from https://ideas.repec.org/p/baf/cbafwp/cbafwp25251.html

Text Analysis Methods for Historical Letters, The case of Michelangelo Buonarroti

Author

Listed:
  • Fabio Gatti
  • Joel Huesler

Abstract

The correspondence of historical personalities serves as a rich source of psychological, social, and economic information. Letters were indeed used as means of communication within the family circles but also a primary method for exchanging information with colleagues, subordinates, and employers. A quantitative analysis of such material enables scholars to reconstruct both the internal psychology and the relational networks of historical figures, ultimately providing deeper insights into the socio-economic systems in which they were embedded. In this study, we analyze the outgoing correspondence of Michelangelo Buonarroti, a prominent Renaissance artist, using a collection of 523 letters as the basis for a structured text analysis. Our methodological approach compares three distinct Natural Language Processing Methods: an Augmented Dictionary Approach, which relies on static lexicon analysis and Latent Dirichlet Allocation (LDA) for topic modeling, a Supervised Machine Learning Approach that utilizes BERT-generated letter embeddings combined with a Random Forest classifier trained by the authors, and an Unsupervised Machine Learning Method. The comparison of these three methods, benchmarked to biographic knowledge, allows us to construct a robust understanding of Michelangelo’s emotional association to monetary, thematic, and social factors. Furthermore, it highlights how the Supervised Machine Learning method, by incorporating the authors’ domain knowledge and understanding of documents and background, can provide, in the context of Renaissance multi-themed letters, a more nuanced interpretation of contextual meanings, enabling the detection of subtle (positive or negative) sentimental variations due to a variety of factors that other methods can overlook.

Suggested Citation

  • Fabio Gatti & Joel Huesler, 2025. "Text Analysis Methods for Historical Letters, The case of Michelangelo Buonarroti," BAFFI CAREFIN Working Papers 25251, BAFFI CAREFIN, Centre for Applied Research on International Markets Banking Finance and Regulation, Universita' Bocconi, Milano, Italy.
  • Handle: RePEc:baf:cbafwp:cbafwp25251
    as

    Download full text from publisher

    File URL: https://repec.unibocconi.it/baffic/baf/papers/cbafwp25251.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Robert Suits & Elisabeth Moyer, 2024. "Estimating energy flows in the long run: Agriculture in the United States, 1800–2020," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 57(4), pages 242-251, October.
    2. Philipp Koch & Viktor Stojkoski & César A. Hidalgo, 2024. "Augmenting the availability of historical GDP per capita estimates through machine learning," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 121(39), pages 2402060121-, September.
    3. Lino Wehrheim, 2019. "Economic history goes digital: topic modeling the Journal of Economic History," Cliometrica, Journal of Historical Economics and Econometric History, Association Française de Cliométrie (AFC), vol. 13(1), pages 83-125, January.
    4. Jawad Daheur & Julia Le Noë, 2024. "Socio-ecological metabolism and rural livelihood conditions: Two case studies on forest litter uses in France and Poland (1875–1910)," Historical Methods: A Journal of Quantitative and Interdisciplinary History, Taylor & Francis Journals, vol. 57(4), pages 205-225, October.
    5. Jules H. van Binsbergen & Svetlana Bryzgalova & Mayukh Mukhopadhyay & Varun Sharma, 2024. "(Almost) 200 Years of News-Based Economic Sentiment," NBER Working Papers 32026, National Bureau of Economic Research, Inc.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fabio Gatti & Joel Huesler, 2025. "Text Analysis Methods for Historical Letters, The case of Michelangelo Buonarrotti," Working Papers 0279, European Historical Economics Society (EHES).
    2. Mohamed M. Mostafa, 2023. "A one-hundred-year structural topic modeling analysis of the knowledge structure of international management research," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3905-3935, August.
    3. Beckmann, Joscha & Czudaj, Robert L. & Murach, Michael, 2024. "Macroeconomic effects from media coverage of the China–U.S. trade war on selected EU countries," European Journal of Political Economy, Elsevier, vol. 85(C).
    4. Claude Diebolt & Michael Haupert, 2021. "The Role of Cliometrics in History and Economics," Working Papers of BETA 2021-26, Bureau d'Economie Théorique et Appliquée, UDS, Strasbourg.
    5. Levy, Daniel & Mayer, Tamir & Raviv, Alon, 2022. "Economists in the 2008 financial crisis: Slow to see, fast to act," Journal of Financial Stability, Elsevier, vol. 60(C).
    6. Claude Diebolt & Michael Haupert, 2020. "How Cliometrics has Infiltrated Economics – and Helped to Improve the Discipline," Annals of the Fondazione Luigi Einaudi. An Interdisciplinary Journal of Economics, History and Political Science, Fondazione Luigi Einaudi, Torino (Italy), vol. 54(1), pages 219-230, June.
    7. Manish Jha & Jialin Qian & Michael Weber & Baozhong Yang, 2024. "Generative AI, Managerial Expectations, and Economic Activity," Papers 2410.03897, arXiv.org, revised Nov 2025.
    8. Peter Grajzl & Peter Murrell, 2021. "Characterizing a legal–intellectual culture: Bacon, Coke, and seventeenth-century England," Cliometrica, Journal of Historical Economics and Econometric History, Association Française de Cliométrie (AFC), vol. 15(1), pages 43-88, January.
    9. W. Benedikt Schmal, 2024. "Quantitative Tools for Time Series Analysis in Natural Language Processing: A Practitioners Guide," Papers 2404.18499, arXiv.org.
    10. Anselm Küsters, 2022. "Applying Lessons from the Past? Exploring Historical Analogies in ECB Speeches through Text Mining, 1997–2019," International Journal of Central Banking, International Journal of Central Banking, vol. 18(1), pages 277-329, March.
    11. David Lenz & Peter Winker, 2020. "Measuring the diffusion of innovations with paragraph vector topic models," PLOS ONE, Public Library of Science, vol. 15(1), pages 1-18, January.
    12. Küsters Anselm & Andritzky Jochen, 2024. "Welche Rolle spielt das Thema Zukunft im Bundestag?," Wirtschaftsdienst, Sciendo, vol. 104(4), pages 252-257, April.
    13. Federico Pablo-Martí & Ángel Alañón-Pardo & Angel Sánchez, 2021. "Complex networks to understand the past: the case of roads in Bourbon Spain," Cliometrica, Journal of Historical Economics and Econometric History, Association Française de Cliométrie (AFC), vol. 15(3), pages 477-534, September.
    14. Nadia Fernández-de-Pinedo & Alvaro La Parra-Perez & Félix-Fernando Muñoz, 2023. "Correction to: Recent trends in publications of economic historians in Europe and North America (1980–2019): an empirical analysis," Cliometrica, Springer;Cliometric Society (Association Francaise de Cliométrie), vol. 17(1), pages 185-185, January.
    15. Wehrheim, Lino, 2021. "The sound of silence: On the (in)visibility of economists in the media," Working Papers 30, German Research Foundation's Priority Programme 1859 "Experience and Expectation. Historical Foundations of Economic Behaviour", Humboldt University Berlin.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:baf:cbafwp:cbafwp25251. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Michela Pozzi (email available below). General contact details of provider: https://edirc.repec.org/data/cbbocit.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.