IDEAS home Printed from https://ideas.repec.org/p/bdi/opques/qef_692_22.html
   My bibliography  Save this paper

Textual analysis of a Twitter corpus during the COVID-19 pandemics

Author

Listed:
  • Valerio Astuti

    (Bank of Italy)

  • Marta Crispino

    (Bank of Italy)

  • Marco Langiulli

    (Bank of Italy)

  • Juri Marcucci

    (Bank of Italy)

Abstract

Text data gathered from social media are extremely up-to-date and have a great potential value for economic research. At the same time, they pose some challenges, as they require different statistical methods from the ones used for traditional data. The aim of this paper is to give a critical overview of three of the most common techniques used to extract information from text data: topic modelling, word embedding and sentiment analysis. We apply these methodologies to data collected from Twitter during the COVID-19 pandemic to investigate the influence the pandemic had on the Italian Twitter community and to discover the topics most actively discussed on the platform. Using these techniques of automated textual analysis, we are able to make inferences about the most important subjects covered over time and build real-time daily indicators of the sentiment expressed on this platform.

Suggested Citation

  • Valerio Astuti & Marta Crispino & Marco Langiulli & Juri Marcucci, 2022. "Textual analysis of a Twitter corpus during the COVID-19 pandemics," Questioni di Economia e Finanza (Occasional Papers) 692, Bank of Italy, Economic Research and International Relations Area.
  • Handle: RePEc:bdi:opques:qef_692_22
    as

    Download full text from publisher

    File URL: https://www.bancaditalia.it/pubblicazioni/qef/2022-0692/QEF_692_22.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Simon Porcher & Thomas Renault, 2021. "Social distancing beliefs and human mobility: Evidence from Twitter," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-12, March.
    2. Brodeur, Abel & Clark, Andrew E. & Fleche, Sarah & Powdthavee, Nattavudh, 2021. "COVID-19, lockdowns and well-being: Evidence from Google Trends," Journal of Public Economics, Elsevier, vol. 193(C).
    3. Altig, Dave & Baker, Scott & Barrero, Jose Maria & Bloom, Nicholas & Bunn, Philip & Chen, Scarlet & Davis, Steven J. & Leather, Julia & Meyer, Brent & Mihaylov, Emil & Mizen, Paul & Parker, Nicholas &, 2020. "Economic uncertainty before and during the COVID-19 pandemic," Journal of Public Economics, Elsevier, vol. 191(C).
    4. Renault, Thomas, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Journal of Banking & Finance, Elsevier, vol. 84(C), pages 25-40.
    5. Thomas Renault, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-03205113, HAL.
    6. Margaret E. Roberts & Brandon M. Stewart & Edoardo M. Airoldi, 2016. "A Model of Text for Experimentation in the Social Sciences," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 988-1003, July.
    7. Nicholas Beauchamp, 2017. "Predicting and Interpolating State‐Level Polls Using Twitter Textual Data," American Journal of Political Science, John Wiley & Sons, vol. 61(2), pages 490-503, April.
    8. Angelico, Cristina & Marcucci, Juri & Miccoli, Marcello & Quarta, Filippo, 2022. "Can we measure inflation expectations using Twitter?," Journal of Econometrics, Elsevier, vol. 228(2), pages 259-277.
    9. Edoardo M. Airoldi & Jonathan M. Bischof, 2016. "Improving and Evaluating Topic Models and Other Models of Text," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1381-1403, October.
    10. Hino, Airo & Fahey, Robert A., 2019. "Representing the Twittersphere: Archiving a representative sample of Twitter data under resource constraints," International Journal of Information Management, Elsevier, vol. 48(C), pages 175-184.
    11. Dave Altig & Scott Baker & Jose Maria Barrero & Nick Bloom & Philip Bunn & Scarlet Chen & Steven J Davis & Julia Leather & Brent Meyer & Emil Mihaylov & Paul Mizen & Nick Parker & Thomas Renault & Paw, 2020. "Economic uncertainty before and during the Covid-19 pandemic," Bank of England working papers 876, Bank of England.
    12. Ro'ee Levy, 2021. "Social Media, News Consumption, and Polarization: Evidence from a Field Experiment," American Economic Review, American Economic Association, vol. 111(3), pages 831-870, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Simon Porcher & Thomas Renault, 2021. "Social distancing beliefs and human mobility: Evidence from Twitter," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-12, March.
    2. Sakariyahu, Rilwan & Lawal, Rodiat & Adigun, Rasheed & Paterson, Audrey & Johan, Sofia, 2024. "One crash, too many: Global uncertainty, sentiment factors and cryptocurrency market," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 94(C).
    3. Zeitun, Rami & Rehman, Mobeen Ur & Ahmad, Nasir & Vo, Xuan Vinh, 2023. "The impact of Twitter-based sentiment on US sectoral returns," The North American Journal of Economics and Finance, Elsevier, vol. 64(C).
    4. Aprigliano, Valentina & Emiliozzi, Simone & Guaitoli, Gabriele & Luciani, Andrea & Marcucci, Juri & Monteforte, Libero, 2023. "The power of text-based indicators in forecasting Italian economic activity," International Journal of Forecasting, Elsevier, vol. 39(2), pages 791-808.
    5. J. Anthony Cookson & Corbin Fox & Javier Gil-Bazo & Juan Imbet & Christoph Schiller, 2024. "Social Media as a Bank Run Catalyst," Working Papers hal-04400382, HAL.
    6. Ito, Asei & Lim, Jaehwan & Zhang, Hongyong, 2023. "Catching the political leader's signal: Economic policy uncertainty and firm investment in China," China Economic Review, Elsevier, vol. 81(C).
    7. Andranik Tumasjan, 2024. "The many faces of social media in business and economics research: Taking stock of the literature and looking into the future," Journal of Economic Surveys, Wiley Blackwell, vol. 38(2), pages 389-426, April.
    8. Wichmann, Bruno & Wichmann, Roberta, 2022. "COVID-19 and Indigenous health in the Brazilian Amazon," Economic Modelling, Elsevier, vol. 115(C).
    9. Arin, K. Peren & Lacomba, Juan A. & Lagos, Francisco & Moro-Egido, Ana I. & Thum, Marcel, 2022. "Exploring the hidden impact of the Covid-19 pandemic: The role of urbanization," Economics & Human Biology, Elsevier, vol. 46(C).
    10. Heath,Rachel & Van Der Weide,Roy, 2024. "Gender, Social Support, and Political Speech : Evidence from Twitter," Policy Research Working Paper Series 10769, The World Bank.
    11. Chen, Jingjing & Chen, Wei & Liu, Ernest & Luo, Jie & Song, Zheng, 2025. "The economic cost of locking down like China: Evidence from city-to-city truck flows," Journal of Urban Economics, Elsevier, vol. 145(C).
    12. Wu, Jianxin & Zhan, Xiaoling & Xu, Hui & Ma, Chunbo, 2023. "The economic impacts of COVID-19 and city lockdown: Early evidence from China," Structural Change and Economic Dynamics, Elsevier, vol. 65(C), pages 151-165.
    13. Nguyen, Harvey & Pham, Anh Viet & Pham, Man Duy (Marty) & Pham, Mia Hang, 2023. "Business resilience: Lessons from government responses to the global COVID-19 crisis," International Business Review, Elsevier, vol. 32(5).
    14. Cookson, J. Anthony & Lu, Runjing & Mullins, William & Niessner, Marina, 2024. "The social signal," Journal of Financial Economics, Elsevier, vol. 158(C).
    15. Vacca, Matteo, 2024. "Panic herding: Analysts' COVID-19 experiences and the interpretation of earnings news," Journal of Economics and Business, Elsevier, vol. 132(C).
    16. Chen, Zhuo & Li, Pengfei & Liao, Li & Liu, Lu & Wang, Zhengwei, 2024. "Assessing and addressing the coronavirus-induced economic crisis: Evidence from 1.5 billion sales invoices," China Economic Review, Elsevier, vol. 85(C).
    17. Miescu, Mirela & Rossi, Raffaele, 2021. "COVID-19-induced shocks and uncertainty," European Economic Review, Elsevier, vol. 139(C).
    18. Capucine Riom & Anna Valero, 2020. "The business response to Covid-19: the CEP-CBI survey on technology adoption," CEP Covid-19 Analyses cepcovid-19-009, Centre for Economic Performance, LSE.
    19. Shen, Yiran & Liu, Chang & Sun, Xiaolei & Guo, Kun, 2023. "Investor sentiment and the Chinese new energy stock market: A risk–return perspective," International Review of Economics & Finance, Elsevier, vol. 84(C), pages 395-408.
    20. Siyi Liu & Xin Liu & Chuancai Zhang & Lingli Zhang, 2023. "Institutional and individual investors' short‐term reactions to the COVID‐19 crisis in China," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 63(4), pages 4333-4355, December.

    More about this item

    Keywords

    text as data; Twitter; big data; sentiment; Covid-19; topic analysis; word embedding;
    All these keywords.

    JEL classification:

    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • L82 - Industrial Organization - - Industry Studies: Services - - - Entertainment; Media

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bdi:opques:qef_692_22. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/bdigvit.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.