IDEAS home Printed from https://ideas.repec.org/p/bdi/opques/qef_692_22.html
   My bibliography  Save this paper

Textual analysis of a Twitter corpus during the COVID-19 pandemics

Author

Listed:
  • Valerio Astuti

    (Bank of Italy)

  • Marta Crispino

    (Bank of Italy)

  • Marco Langiulli

    (Bank of Italy)

  • Juri Marcucci

    (Bank of Italy)

Abstract

Text data gathered from social media are extremely up-to-date and have a great potential value for economic research. At the same time, they pose some challenges, as they require different statistical methods from the ones used for traditional data. The aim of this paper is to give a critical overview of three of the most common techniques used to extract information from text data: topic modelling, word embedding and sentiment analysis. We apply these methodologies to data collected from Twitter during the COVID-19 pandemic to investigate the influence the pandemic had on the Italian Twitter community and to discover the topics most actively discussed on the platform. Using these techniques of automated textual analysis, we are able to make inferences about the most important subjects covered over time and build real-time daily indicators of the sentiment expressed on this platform.

Suggested Citation

  • Valerio Astuti & Marta Crispino & Marco Langiulli & Juri Marcucci, 2022. "Textual analysis of a Twitter corpus during the COVID-19 pandemics," Questioni di Economia e Finanza (Occasional Papers) 692, Bank of Italy, Economic Research and International Relations Area.
  • Handle: RePEc:bdi:opques:qef_692_22
    as

    Download full text from publisher

    File URL: https://www.bancaditalia.it/pubblicazioni/qef/2022-0692/QEF_692_22.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Simon Porcher & Thomas Renault, 2021. "Social distancing beliefs and human mobility: Evidence from Twitter," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-12, March.
    2. Brodeur, Abel & Clark, Andrew E. & Fleche, Sarah & Powdthavee, Nattavudh, 2021. "COVID-19, lockdowns and well-being: Evidence from Google Trends," Journal of Public Economics, Elsevier, vol. 193(C).
    3. Altig, Dave & Baker, Scott & Barrero, Jose Maria & Bloom, Nicholas & Bunn, Philip & Chen, Scarlet & Davis, Steven J. & Leather, Julia & Meyer, Brent & Mihaylov, Emil & Mizen, Paul & Parker, Nicholas &, 2020. "Economic uncertainty before and during the COVID-19 pandemic," Journal of Public Economics, Elsevier, vol. 191(C).
    4. Renault, Thomas, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Journal of Banking & Finance, Elsevier, vol. 84(C), pages 25-40.
    5. Thomas Renault, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-03205113, HAL.
    6. Margaret E. Roberts & Brandon M. Stewart & Edoardo M. Airoldi, 2016. "A Model of Text for Experimentation in the Social Sciences," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 988-1003, July.
    7. Nicholas Beauchamp, 2017. "Predicting and Interpolating State‐Level Polls Using Twitter Textual Data," American Journal of Political Science, John Wiley & Sons, vol. 61(2), pages 490-503, April.
    8. Angelico, Cristina & Marcucci, Juri & Miccoli, Marcello & Quarta, Filippo, 2022. "Can we measure inflation expectations using Twitter?," Journal of Econometrics, Elsevier, vol. 228(2), pages 259-277.
    9. Edoardo M. Airoldi & Jonathan M. Bischof, 2016. "Improving and Evaluating Topic Models and Other Models of Text," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1381-1403, October.
    10. Hino, Airo & Fahey, Robert A., 2019. "Representing the Twittersphere: Archiving a representative sample of Twitter data under resource constraints," International Journal of Information Management, Elsevier, vol. 48(C), pages 175-184.
    11. Ro'ee Levy, 2021. "Social Media, News Consumption, and Polarization: Evidence from a Field Experiment," American Economic Review, American Economic Association, vol. 111(3), pages 831-870, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Simon Porcher & Thomas Renault, 2021. "Social distancing beliefs and human mobility: Evidence from Twitter," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-12, March.
    2. Juan Imbet & J. Anthony Cookson & Corbin Fox & Christoph Schiller & Javier Gil-Bazo, 2024. "Social Media as a Bank Run Catalyst," Post-Print hal-04660083, HAL.
    3. Zeitun, Rami & Rehman, Mobeen Ur & Ahmad, Nasir & Vo, Xuan Vinh, 2023. "The impact of Twitter-based sentiment on US sectoral returns," The North American Journal of Economics and Finance, Elsevier, vol. 64(C).
    4. Sandra Wankmüller, 2023. "A comparison of approaches for imbalanced classification problems in the context of retrieving relevant documents for an analysis," Journal of Computational Social Science, Springer, vol. 6(1), pages 91-163, April.
    5. Shen, Yiran & Liu, Chang & Sun, Xiaolei & Guo, Kun, 2023. "Investor sentiment and the Chinese new energy stock market: A risk–return perspective," International Review of Economics & Finance, Elsevier, vol. 84(C), pages 395-408.
    6. Zhang, Junsheng & Peng, Zezhi & Zeng, Yamin & Yang, Haisheng, 2023. "Do big data mutual funds outperform?," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 88(C).
    7. Dehler-Holland, Joris & Schumacher, Kira & Fichtner, Wolf, 2021. "Topic Modeling Uncovers Shifts in Media Framing of the German Renewable Energy Act," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 2(1).
    8. Thomas Renault, 2020. "Sentiment analysis and machine learning in finance: a comparison of methods and models on one million messages," Digital Finance, Springer, vol. 2(1), pages 1-13, September.
    9. Gianluca Anese & Marco Corazza & Michele Costola & Loriana Pelizzon, 2023. "Impact of public news sentiment on stock market index return and volatility," Computational Management Science, Springer, vol. 20(1), pages 1-36, December.
    10. Béatrice BOULU-RESHEF & Catherine BRUNEAU & Maxime NICOLAS & Thomas RENAULT, 2022. "An Experimental Analysis of Investor Sentiment," LEO Working Papers / DR LEO 2940, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.
    11. Geng, Yuedan & Ye, Qiang & Jin, Yu & Shi, Wen, 2022. "Crowd wisdom and internet searches: What happens when investors search for stocks?," International Review of Financial Analysis, Elsevier, vol. 82(C).
    12. Seok, Sangik & Cho, Hoon & Ryu, Doojin, 2022. "Scheduled macroeconomic news announcements and intraday market sentiment," The North American Journal of Economics and Finance, Elsevier, vol. 62(C).
    13. Chen, Haozhi & Zhang, Yue, 2023. "Research on the effect of firm-specific investor sentiment on the idiosyncratic volatility anomaly: Evidence from the Chinese market," Pacific-Basin Finance Journal, Elsevier, vol. 81(C).
    14. Abakah, Emmanuel Joel Aikins & Abdullah, Mohammad & Yousaf, Imran & Kumar Tiwari, Aviral & Li, Yanshuang, 2024. "Economic sanctions sentiment and global stock markets," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 91(C).
    15. Gaoshan Wang & Guangjin Yu & Xiaohong Shen, 2020. "The Effect of Online Investor Sentiment on Stock Movements: An LSTM Approach," Complexity, Hindawi, vol. 2020, pages 1-11, December.
    16. Wu, Jianxin & Zhan, Xiaoling & Xu, Hui & Ma, Chunbo, 2023. "The economic impacts of COVID-19 and city lockdown: Early evidence from China," Structural Change and Economic Dynamics, Elsevier, vol. 65(C), pages 151-165.
    17. Smita Roy Trivedi, 2024. "Into the Unknown: Uncertainty, Foreboding and Financial Markets," Asia-Pacific Financial Markets, Springer;Japanese Association of Financial Economics and Engineering, vol. 31(1), pages 1-23, March.
    18. Camilla Salvatore & Silvia Biffignandi & Annamaria Bianchi, 2022. "Corporate Social Responsibility Activities Through Twitter: From Topic Model Analysis to Indexes Measuring Communication Characteristics," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 164(3), pages 1217-1248, December.
    19. Fan, Rui & Talavera, Oleksandr & Tran, Vu, 2023. "Information flows and the law of one price," International Review of Financial Analysis, Elsevier, vol. 85(C).
    20. Aprigliano, Valentina & Emiliozzi, Simone & Guaitoli, Gabriele & Luciani, Andrea & Marcucci, Juri & Monteforte, Libero, 2023. "The power of text-based indicators in forecasting Italian economic activity," International Journal of Forecasting, Elsevier, vol. 39(2), pages 791-808.

    More about this item

    Keywords

    text as data; Twitter; big data; sentiment; Covid-19; topic analysis; word embedding;
    All these keywords.

    JEL classification:

    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • L82 - Industrial Organization - - Industry Studies: Services - - - Entertainment; Media

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bdi:opques:qef_692_22. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/bdigvit.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.