IDEAS home Printed from https://ideas.repec.org/p/bdi/opques/qef_692_22.html
   My bibliography  Save this paper

Textual analysis of a Twitter corpus during the COVID-19 pandemics

Author

Listed:
  • Valerio Astuti

    (Bank of Italy)

  • Marta Crispino

    (Bank of Italy)

  • Marco Langiulli

    (Bank of Italy)

  • Juri Marcucci

    (Bank of Italy)

Abstract

Text data gathered from social media are extremely up-to-date and have a great potential value for economic research. At the same time, they pose some challenges, as they require different statistical methods from the ones used for traditional data. The aim of this paper is to give a critical overview of three of the most common techniques used to extract information from text data: topic modelling, word embedding and sentiment analysis. We apply these methodologies to data collected from Twitter during the COVID-19 pandemic to investigate the influence the pandemic had on the Italian Twitter community and to discover the topics most actively discussed on the platform. Using these techniques of automated textual analysis, we are able to make inferences about the most important subjects covered over time and build real-time daily indicators of the sentiment expressed on this platform.

Suggested Citation

  • Valerio Astuti & Marta Crispino & Marco Langiulli & Juri Marcucci, 2022. "Textual analysis of a Twitter corpus during the COVID-19 pandemics," Questioni di Economia e Finanza (Occasional Papers) 692, Bank of Italy, Economic Research and International Relations Area.
  • Handle: RePEc:bdi:opques:qef_692_22
    as

    Download full text from publisher

    File URL: https://www.bancaditalia.it/pubblicazioni/qef/2022-0692/QEF_692_22.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Simon Porcher & Thomas Renault, 2021. "Social distancing beliefs and human mobility: Evidence from Twitter," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-12, March.
    2. Brodeur, Abel & Clark, Andrew E. & Fleche, Sarah & Powdthavee, Nattavudh, 2021. "COVID-19, lockdowns and well-being: Evidence from Google Trends," Journal of Public Economics, Elsevier, vol. 193(C).
    3. Altig, Dave & Baker, Scott & Barrero, Jose Maria & Bloom, Nicholas & Bunn, Philip & Chen, Scarlet & Davis, Steven J. & Leather, Julia & Meyer, Brent & Mihaylov, Emil & Mizen, Paul & Parker, Nicholas &, 2020. "Economic uncertainty before and during the COVID-19 pandemic," Journal of Public Economics, Elsevier, vol. 191(C).
    4. Renault, Thomas, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Journal of Banking & Finance, Elsevier, vol. 84(C), pages 25-40.
    5. Thomas Renault, 2017. "Intraday online investor sentiment and return patterns in the U.S. stock market," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-03205113, HAL.
    6. Margaret E. Roberts & Brandon M. Stewart & Edoardo M. Airoldi, 2016. "A Model of Text for Experimentation in the Social Sciences," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 988-1003, July.
    7. Nicholas Beauchamp, 2017. "Predicting and Interpolating State‐Level Polls Using Twitter Textual Data," American Journal of Political Science, John Wiley & Sons, vol. 61(2), pages 490-503, April.
    8. Angelico, Cristina & Marcucci, Juri & Miccoli, Marcello & Quarta, Filippo, 2022. "Can we measure inflation expectations using Twitter?," Journal of Econometrics, Elsevier, vol. 228(2), pages 259-277.
    9. Edoardo M. Airoldi & Jonathan M. Bischof, 2016. "Improving and Evaluating Topic Models and Other Models of Text," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1381-1403, October.
    10. Hino, Airo & Fahey, Robert A., 2019. "Representing the Twittersphere: Archiving a representative sample of Twitter data under resource constraints," International Journal of Information Management, Elsevier, vol. 48(C), pages 175-184.
    11. Ro'ee Levy, 2021. "Social Media, News Consumption, and Polarization: Evidence from a Field Experiment," American Economic Review, American Economic Association, vol. 111(3), pages 831-870, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Simon Porcher & Thomas Renault, 2021. "Social distancing beliefs and human mobility: Evidence from Twitter," PLOS ONE, Public Library of Science, vol. 16(3), pages 1-12, March.
    2. J. Anthony Cookson & Corbin Fox & Javier Gil-Bazo & Juan Imbet & Christoph Schiller, 2024. "Social Media as a Bank Run Catalyst," Working Papers hal-04400382, HAL.
    3. Sakariyahu, Rilwan & Lawal, Rodiat & Adigun, Rasheed & Paterson, Audrey & Johan, Sofia, 2024. "One crash, too many: Global uncertainty, sentiment factors and cryptocurrency market," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 94(C).
    4. Andranik Tumasjan, 2024. "The many faces of social media in business and economics research: Taking stock of the literature and looking into the future," Journal of Economic Surveys, Wiley Blackwell, vol. 38(2), pages 389-426, April.
    5. Cookson, J. Anthony & Lu, Runjing & Mullins, William & Niessner, Marina, 2024. "The social signal," Journal of Financial Economics, Elsevier, vol. 158(C).
    6. Zeitun, Rami & Rehman, Mobeen Ur & Ahmad, Nasir & Vo, Xuan Vinh, 2023. "The impact of Twitter-based sentiment on US sectoral returns," The North American Journal of Economics and Finance, Elsevier, vol. 64(C).
    7. Shen, Yiran & Liu, Chang & Sun, Xiaolei & Guo, Kun, 2023. "Investor sentiment and the Chinese new energy stock market: A risk–return perspective," International Review of Economics & Finance, Elsevier, vol. 84(C), pages 395-408.
    8. Thomas Renault, 2020. "Sentiment analysis and machine learning in finance: a comparison of methods and models on one million messages," Digital Finance, Springer, vol. 2(1), pages 1-13, September.
    9. Seok, Sangik & Cho, Hoon & Ryu, Doojin, 2022. "Scheduled macroeconomic news announcements and intraday market sentiment," The North American Journal of Economics and Finance, Elsevier, vol. 62(C).
    10. Smita Roy Trivedi, 2024. "Into the Unknown: Uncertainty, Foreboding and Financial Markets," Asia-Pacific Financial Markets, Springer;Japanese Association of Financial Economics and Engineering, vol. 31(1), pages 1-23, March.
    11. Camilla Salvatore & Silvia Biffignandi & Annamaria Bianchi, 2022. "Corporate Social Responsibility Activities Through Twitter: From Topic Model Analysis to Indexes Measuring Communication Characteristics," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 164(3), pages 1217-1248, December.
    12. Aprigliano, Valentina & Emiliozzi, Simone & Guaitoli, Gabriele & Luciani, Andrea & Marcucci, Juri & Monteforte, Libero, 2023. "The power of text-based indicators in forecasting Italian economic activity," International Journal of Forecasting, Elsevier, vol. 39(2), pages 791-808.
    13. Chi, Yeguang & El-Jahel, Lina & Vu, Thanh, 2024. "Novel and old news sentiment in commodity futures markets," Energy Economics, Elsevier, vol. 140(C).
    14. Dehler-Holland, Joris & Okoh, Marvin & Keles, Dogan, 2022. "Assessing technology legitimacy with topic models and sentiment analysis – The case of wind power in Germany," Technological Forecasting and Social Change, Elsevier, vol. 175(C).
    15. Song, Ziyu & Yu, Changrui, 2022. "Investor sentiment indices based on k-step PLS algorithm: A group of powerful predictors of stock market returns," International Review of Financial Analysis, Elsevier, vol. 83(C).
    16. Rui Fan & Oleksandr Talavera & Vu Tran, 2023. "Social media and price discovery: The case of cross‐listed firms," Journal of Financial Research, Southern Finance Association;Southwestern Finance Association, vol. 46(1), pages 151-167, February.
    17. Barbaglia, Luca & Bellia, Mario & Di Girolamo, Francesca & Rho, Caterina, 2024. "Crypto news and policy innovations: Are European markets affected?," JRC Working Papers in Economics and Finance 2024-07, Joint Research Centre, European Commission.
    18. Ito, Asei & Lim, Jaehwan & Zhang, Hongyong, 2023. "Catching the political leader's signal: Economic policy uncertainty and firm investment in China," China Economic Review, Elsevier, vol. 81(C).
    19. Marc-Aurèle Divernois & Damir Filipović, 2024. "StockTwits classified sentiment and stock returns," Digital Finance, Springer, vol. 6(2), pages 249-281, June.
    20. Fan, Rui & Talavera, Oleksandr & Tran, Vu, 2023. "Information flows and the law of one price," International Review of Financial Analysis, Elsevier, vol. 85(C).

    More about this item

    Keywords

    text as data; Twitter; big data; sentiment; Covid-19; topic analysis; word embedding;
    All these keywords.

    JEL classification:

    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • L82 - Industrial Organization - - Industry Studies: Services - - - Entertainment; Media

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bdi:opques:qef_692_22. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/bdigvit.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.