IDEAS home Printed from https://ideas.repec.org/a/spr/ijsaem/v15y2024i5d10.1007_s13198-023-02082-0.html
   My bibliography  Save this article

Sentiment analysis and topic modeling of COVID-19 tweets of India

Author

Listed:
  • Manju Bhardwaj

    (Maitreyi College)

  • Priya Mishra

    (Maitreyi College)

  • Shikha Badhani

    (Maitreyi College)

  • Sunil K. Muttoo

    (University of Delhi)

Abstract

Social media platforms provide an opportunity to the users to express their views and emotions on any topic. Various researchers have successfully used the content posted on these platforms to capture the emotions of the people about the given event or topic. During COVID-19 pandemic, Indians extensively used Twitter owing to an increased need for virtual interaction. In this work, we analyse the tweets posted in India during COVID-19 outbreak to understand how individuals in India reacted to the pandemic. We identified the timelines of three major COVID-19 waves from May 2020 to March 2022 and retrieved 13,818 tweets from COV19Tweets dataset available at IEEE DataPort for the respective duration of each of the three waves. Lexicon based sentiment analysis of the tweets indicated a positive mindset of the Indian population during the pandemic. Further, visual analysis through word clouds revealed that a few words were common for all waves whereas some words were wave-specific. It was observed that the words used in tweets cannot be compulsorily associated with positive or negative emotions, as the context or the set of words taken together may be a better indicator. Hence, machine learning approach was followed for the identification of sentiments by extracting BoW (Bag-of-Words) and TF–IDF (Term Frequency–Inverse Document Frequency) features from the tweet text. Comparative performance analysis of the four classification algorithms, namely, Decision Tree (DT), Logistic Regression (LR), Naive Bayes (NB), and Support Vector Machines (SVM) and two ensemble methods Adaboost and Random Forest revealed that LR applied to BoW featureset was the best performer. Finally, we performed Latent Dirichlet Allocation (LDA) based topic modeling on the COVID-19 tweets to identify topics of discussion in each of the waves. The topics evolved from informative messages related to the pandemic during the first wave, to wider discussions related to the impact of COVID-19 on nifty, tourism, etc. for the second wave, and the omicron virus, availability of beds, and ventilators in the third wave. This study can be of great interest to governments, as they may undertake similar studies to understand human behavior when natural calamities or pandemics occur at the local or global levels. The automated capture of public sentiments and identification of topics may expedite the appropriate execution of preventive measures taken by governments and address the concerns of citizens almost instantly.

Suggested Citation

  • Manju Bhardwaj & Priya Mishra & Shikha Badhani & Sunil K. Muttoo, 2024. "Sentiment analysis and topic modeling of COVID-19 tweets of India," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 15(5), pages 1756-1776, May.
  • Handle: RePEc:spr:ijsaem:v:15:y:2024:i:5:d:10.1007_s13198-023-02082-0
    DOI: 10.1007/s13198-023-02082-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13198-023-02082-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13198-023-02082-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lu, Qinli & Chesbrough, Henry, 2022. "Measuring open innovation practices through topic modelling: Revisiting their impact on firm financial performance," Technovation, Elsevier, vol. 114(C).
    2. Lucini, Filipe R. & Tonetto, Leandro M. & Fogliatto, Flavio S. & Anzanello, Michel J., 2020. "Text mining approach to explore dimensions of airline customer satisfaction using online customer reviews," Journal of Air Transport Management, Elsevier, vol. 83(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chandra Mahapatra, Subas & Bellamkonda, Raja Shekhar, 2023. "Higher expectations of passengers do really sense: Development and validation a multiple scale-FliQual for air transport service quality," Journal of Retailing and Consumer Services, Elsevier, vol. 70(C).
    2. Cardinale, Roberto & Cardinale, Ivano & Zupic, Ivan, 2024. "The EU's vulnerability to gas price and supply shocks: The role of mismatches between policy beliefs and changing international gas markets," Energy Economics, Elsevier, vol. 131(C).
    3. Sharan Srinivas & Surya Ramachandiran, 2024. "Passenger intelligence as a competitive opportunity: unsupervised text analytics for discovering airline-specific insights from online reviews," Annals of Operations Research, Springer, vol. 333(2), pages 1045-1075, February.
    4. Nilashi, Mehrbakhsh & Abumalloh, Rabab Ali & Samad, Sarminah & Alrizq, Mesfer & Alyami, Sultan & Abosaq, Hamad & Alghamdi, Abdullah & Akib, Noor Adelyna Mohammed, 2022. "Factors impacting customer purchase intention of smart home security systems: Social data analysis using machine learning techniques," Technology in Society, Elsevier, vol. 71(C).
    5. KOUAKOU, Dorgyles C.M. & YEO, Kolotioloma I.H., 2023. "Can innovation reduce the size of the informal economy? Econometric evidence from 138 countries," MPRA Paper 119264, University Library of Munich, Germany.
    6. Kumar, Avinash & Chakraborty, Shibashish & Bala, Pradip Kumar, 2023. "Text mining approach to explore determinants of grocery mobile app satisfaction using online customer reviews," Journal of Retailing and Consumer Services, Elsevier, vol. 73(C).
    7. Murugesan, R. & A P, Rekha & N, Nitish & Balanathan, Raghavan, 2024. "Forecasting airline passengers’ satisfaction based on sentiments and ratings: An application of VADER and machine learning techniques," Journal of Air Transport Management, Elsevier, vol. 120(C).
    8. Park, Jeongeun & Yang, Donguk & Kim, Ha Young, 2023. "Text mining-based four-step framework for smart speaker product improvement and sales planning," Journal of Retailing and Consumer Services, Elsevier, vol. 71(C).
    9. Marco Guerzoni & Massimiliano Nuccio & Federico Tamagni, 2022. "Discovering pre-entry knowledge complexity with patent topic modeling and the post-entry growth of Italian firms," LEM Papers Series 2022/25, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
    10. Wandelt, Sebastian & Sun, Xiaoqian & Zhang, Anming, 2023. "AI-driven assistants for education and research? A case study on ChatGPT for air transport management," Journal of Air Transport Management, Elsevier, vol. 113(C).
    11. Gang Chen & Shuaiyong Xiao & Chenghong Zhang & Huimin Zhao, 2023. "A Theory-Driven Deep Learning Method for Voice Chat–Based Customer Response Prediction," Information Systems Research, INFORMS, vol. 34(4), pages 1513-1532, December.
    12. Luanying Li & Lin Hua & Fei Gao, 2022. "What We Ask about When We Ask about Quarantine? Content and Sentiment Analysis on Online Help-Seeking Posts during COVID-19 on a Q&A Platform in China," IJERPH, MDPI, vol. 20(1), pages 1-19, December.
    13. Bellizzi, Maria Grazia & dell’Olio, Luigi & Eboli, Laura & Mazzulla, Gabriella, 2021. "Detecting passengers' heterogeneity on airlines’ services using SP data," Journal of Air Transport Management, Elsevier, vol. 96(C).
    14. Nguyen Ngoc Hien & Nguyen Thanh Long & Vo Tan Liem & Danh Xuan Luu, 2024. "Customer Responses to Airline Service Failure: Perspectives from Expectation Disconfirmation Theory," SAGE Open, , vol. 14(2), pages 21582440241, May.
    15. Keld Laursen & Ammon Salter, 2023. "What we know about open innovation, unresolved issues, and a checklist for future research," Economia e Politica Industriale: Journal of Industrial and Business Economics, Springer;Associazione Amici di Economia e Politica Industriale, vol. 50(4), pages 743-764, December.
    16. Yi, Jisu & Kim, Jongdae & Oh, Yun Kyung, 2024. "Uncovering the quality factors driving the success of mobile payment apps," Journal of Retailing and Consumer Services, Elsevier, vol. 77(C).
    17. Xie, Haotian & Li, Yi & Pu, Yang & Zhang, Chen & Huang, Junlin, 2024. "Evaluating airline service quality through a comprehensive text-mining and multi-criteria decision-making analysis," Journal of Air Transport Management, Elsevier, vol. 120(C).
    18. Levent Çallı & Büşra Alma Çallı, 2024. "Value‐centric analysis of user adoption for sustainable urban micro‐mobility transportation through shared e‐scooter services," Sustainable Development, John Wiley & Sons, Ltd., vol. 32(6), pages 6408-6433, December.
    19. Bellizzi, Maria Grazia & Eboli, Laura & Mazzulla, Gabriella & Postorino, Maria Nadia, 2022. "Classification trees for analysing highly educated people satisfaction with airlines’ services," Transport Policy, Elsevier, vol. 116(C), pages 199-211.
    20. Shanshan Wu & Long Cheng & Changcheng Huang & Yaoyao Chen, 2024. "The impact of open innovation on firms’ performance in bad times: evidence from COVID-19 pandemic," Eurasian Business Review, Springer;Eurasia Business and Economics Society, vol. 14(3), pages 657-694, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:ijsaem:v:15:y:2024:i:5:d:10.1007_s13198-023-02082-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.