IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v19y2022i20p13248-d942254.html
   My bibliography  Save this article

Revealing Public Opinion towards the COVID-19 Vaccine with Weibo Data in China: BertFDA-Based Model

Author

Listed:
  • Jianping Zhu

    (National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen 361005, China
    Data Mining Research Center, Xiamen University, Xiamen 361005, China
    School of Management, Xiamen University, Xiamen 361005, China)

  • Futian Weng

    (National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen 361005, China
    Data Mining Research Center, Xiamen University, Xiamen 361005, China
    School of Medicine, Xiamen University, Xiamen 361005, China)

  • Muni Zhuang

    (National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen 361005, China
    Data Mining Research Center, Xiamen University, Xiamen 361005, China
    School of Medicine, Xiamen University, Xiamen 361005, China)

  • Xin Lu

    (College of Systems Engineering, National University of Defense Technology, Changsha 410073, China)

  • Xu Tan

    (Career-Oriented Multidisciplinary Education Center, Shenzhen Institiute of Information Technology, Shenzhen 518172, China)

  • Songjie Lin

    (Career-Oriented Multidisciplinary Education Center, Shenzhen Institiute of Information Technology, Shenzhen 518172, China)

  • Ruoyi Zhang

    (Columbia College of Art and Science, George Washington University, Washington, DC 20052, USA)

Abstract

The COVID-19 pandemic has created unprecedented burdens on people’s health and subjective well-being. While countries around the world have established models to track and predict the affective states of COVID-19, identifying the topics of public discussion and sentiment evolution of the vaccine, particularly the differences in topics of concern between vaccine-support and vaccine-hesitant groups, remains scarce. Using social media data from the two years following the outbreak of COVID-19 (23 January 2020 to 23 January 2022), coupled with state-of-the-art natural language processing (NLP) techniques, we developed a public opinion analysis framework (BertFDA). First, using dynamic topic clustering on Weibo through the latent Dirichlet allocation (LDA) model, a total of 118 topics were generated in 24 months using 2,211,806 microblog posts. Second, by building an improved Bert pre-training model for sentiment classification, we provide evidence that public negative sentiment continued to decline in the early stages of COVID-19 vaccination. Third, by modeling and analyzing the microblog posts from the vaccine-support group and the vaccine-hesitant group, we discover that the vaccine-support group was more concerned about vaccine effectiveness and the reporting of news, reflecting greater group cohesion, whereas the vaccine-hesitant group was particularly concerned about the spread of coronavirus variants and vaccine side effects. Finally, we deployed different machine learning models to predict public opinion. Moreover, functional data analysis (FDA) is developed to build the functional sentiment curve, which can effectively capture the dynamic changes with the explicit function. This study can aid governments in developing effective interventions and education campaigns to boost vaccination rates.

Suggested Citation

  • Jianping Zhu & Futian Weng & Muni Zhuang & Xin Lu & Xu Tan & Songjie Lin & Ruoyi Zhang, 2022. "Revealing Public Opinion towards the COVID-19 Vaccine with Weibo Data in China: BertFDA-Based Model," IJERPH, MDPI, vol. 19(20), pages 1-26, October.
  • Handle: RePEc:gam:jijerp:v:19:y:2022:i:20:p:13248-:d:942254
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/19/20/13248/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/19/20/13248/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Heidi Ledford, 2021. "How could a COVID vaccine cause blood clots? Scientists race to investigate," Nature, Nature, vol. 592(7854), pages 334-335, April.
    2. Weng, Futian & Zhang, Hongwei & Yang, Cai, 2021. "Volatility forecasting of crude oil futures based on a genetic algorithm regularization online extreme learning machine with a forgetting factor: The role of news during the COVID-19 pandemic," Resources Policy, Elsevier, vol. 73(C).
    3. Jose J Padilla & Hamdi Kavak & Christopher J Lynch & Ross J Gore & Saikou Y Diallo, 2018. "Temporal and spatiotemporal investigation of tourist attraction visit sentiment on Twitter," PLOS ONE, Public Library of Science, vol. 13(6), pages 1-20, June.
    4. Zhenjie Liang & Futian Weng & Yuanting Ma & Yan Xu & Miao Zhu & Cai Yang, 2022. "Measurement and Analysis of High Frequency Assert Volatility Based on Functional Data Analysis," Mathematics, MDPI, vol. 10(7), pages 1-11, April.
    5. Sijia Li & Yilin Wang & Jia Xue & Nan Zhao & Tingshao Zhu, 2020. "The Impact of COVID-19 Epidemic Declaration on Psychological Consequences: A Study on Active Weibo Users," IJERPH, MDPI, vol. 17(6), pages 1-9, March.
    6. J. Ramsay, 1982. "When the data are functions," Psychometrika, Springer;The Psychometric Society, vol. 47(4), pages 379-396, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Thanh Bui & Andrea Hannah & Sanjay Madria & Rosemary Nabaweesi & Eugene Levin & Michael Wilson & Long Nguyen, 2023. "Emotional Health and Climate-Change-Related Stressor Extraction from Social Media: A Case Study Using Hurricane Harvey," Mathematics, MDPI, vol. 11(24), pages 1-16, December.
    2. Jian Shi & Hanxiao Wang, 2023. "Examining the Intermedia Agenda Setting Effects amid the Changsheng Vaccine Crisis: A Computational Approach," IJERPH, MDPI, vol. 20(5), pages 1-12, February.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhenjie Liang & Futian Weng & Yuanting Ma & Yan Xu & Miao Zhu & Cai Yang, 2022. "Measurement and Analysis of High Frequency Assert Volatility Based on Functional Data Analysis," Mathematics, MDPI, vol. 10(7), pages 1-11, April.
    2. Israel Escudero-Castillo & Fco. Javier Mato-Díaz & Ana Rodriguez-Alvarez, 2021. "Furloughs, Teleworking and Other Work Situations during the COVID-19 Lockdown: Impact on Mental Well-Being," IJERPH, MDPI, vol. 18(6), pages 1-16, March.
    3. S. Brent Jackson & Kathryn T. Stevenson & Lincoln R. Larson & M. Nils Peterson & Erin Seekamp, 2021. "Outdoor Activity Participation Improves Adolescents’ Mental Health and Well-Being during the COVID-19 Pandemic," IJERPH, MDPI, vol. 18(5), pages 1-18, March.
    4. Massimiliano Scopelliti & Maria Giuseppina Pacilli & Antonio Aquino, 2021. "TV News and COVID-19: Media Influence on Healthy Behavior in Public Spaces," IJERPH, MDPI, vol. 18(4), pages 1-15, February.
    5. Yuan Zheng & Jingyi Zhou & Xianglong Zeng & Mingyan Jiang & Tian P. S. Oei, 2022. "A New Second-Generation Mindfulness-Based Intervention Focusing on Well-Being: A Randomized Control Trial of Mindfulness-Based Positive Psychology," Journal of Happiness Studies, Springer, vol. 23(6), pages 2703-2724, August.
    6. Cristhian Leonardo Urbano-Leon & Manuel Escabias & Diana Paola Ovalle-Muñoz & Javier Olaya-Ochoa, 2023. "Scalar Variance and Scalar Correlation for Functional Data," Mathematics, MDPI, vol. 11(6), pages 1-20, March.
    7. Clemens Koestner & Viktoria Eggert & Theresa Dicks & Kristin Kalo & Carolina Zähme & Pavel Dietz & Stephan Letzel & Till Beutel, 2022. "Psychological Burdens among Teachers in Germany during the SARS-CoV-2 Pandemic—Subgroup Analysis from a Nationwide Cross-Sectional Online Survey," IJERPH, MDPI, vol. 19(15), pages 1-16, August.
    8. Yong Gao & Yuanyuan Chen & Lan Mu & Shize Gong & Pengcheng Zhang & Yu Liu, 2022. "Measuring urban sentiments from social media data: a dual-polarity metric approach," Journal of Geographical Systems, Springer, vol. 24(2), pages 199-221, April.
    9. Durmuş Burak, 2023. "The Effect of Risk and Protective Factors on Primary School Students’ COVID-19 Anxiety: Back to School After the Pandemic," Child Indicators Research, Springer;The International Society of Child Indicators (ISCI), vol. 16(1), pages 29-51, February.
    10. Maraseni, Tek & Poudyal, Bishnu Hari & Aryal, Kishor & Laudari, Hari Krishna, 2022. "Impact of COVID-19 in the forestry sector: A case of lowland region of Nepal," Land Use Policy, Elsevier, vol. 120(C).
    11. Ilse Adriana Gutiérrez-Pérez & Pedro Delgado-Floody & Daniel Jerez-Mayorga & Diego Soto-García & Felipe Caamaño-Navarrete & Isela Parra-Rojas & Nacim Molina-Gutiérrez & Iris Paola Guzmán-Guzmán, 2021. "Lifestyle and Sociodemographic Parameters Associated with Mental and Physical Health during COVID-19 Confinement in Three Ibero-American Countries. A Cross-Sectional Pilot Study," IJERPH, MDPI, vol. 18(10), pages 1-13, May.
    12. Philippe Besse & J. Ramsay, 1986. "Principal components analysis of sampled functions," Psychometrika, Springer;The Psychometric Society, vol. 51(2), pages 285-311, June.
    13. Christian M. Hafner, 2020. "The Spread of the Covid-19 Pandemic in Time and Space," IJERPH, MDPI, vol. 17(11), pages 1-13, May.
    14. Siqi Lai & Brian Deal, 2022. "Parks, Green Space, and Happiness: A Spatially Specific Sentiment Analysis Using Microblogs in Shanghai, China," Sustainability, MDPI, vol. 15(1), pages 1-18, December.
    15. Jadsada Kunno & Busaba Supawattanabodee & Chavanant Sumanasrethakul & Chuthamat Kaewchandee & Wachiraporn Wanichnopparat & Krit Prasittichok, 2022. "The Relationship between Attitudes and Satisfaction Concerning the COVID-19 Vaccine and Vaccine Boosters in Urban Bangkok, Thailand: A Cross-Sectional Study," IJERPH, MDPI, vol. 19(9), pages 1-11, April.
    16. Mateusz Ciski & Krzysztof Rząsa, 2023. "Multiscale Geographically Weighted Regression in the Investigation of Local COVID-19 Anomalies Based on Population Age Structure in Poland," IJERPH, MDPI, vol. 20(10), pages 1-23, May.
    17. Kalogridis, Ioannis & Van Aelst, Stefan, 2023. "Robust penalized estimators for functional linear regression," Journal of Multivariate Analysis, Elsevier, vol. 194(C).
    18. Yang Yang & Keqiao Liu & Siqi Li & Man Shu, 2020. "Social Media Activities, Emotion Regulation Strategies, and Their Interactions on People’s Mental Health in COVID-19 Pandemic," IJERPH, MDPI, vol. 17(23), pages 1-16, December.
    19. Alessandro Germani & Livia Buratta & Elisa Delvecchio & Claudia Mazzeschi, 2020. "Emerging Adults and COVID-19: The Role of Individualism-Collectivism on Perceived Risks and Psychological Maladjustment," IJERPH, MDPI, vol. 17(10), pages 1-15, May.
    20. Jacqueline-Nathalie Harba & Gabriela Tigu & Adriana AnaMaria Davidescu, 2021. "Exploring Consumer Emotions in Pre-Pandemic and Pandemic Times. A Sentiment Analysis of Perceptions in the Fine-Dining Restaurant Industry in Bucharest, Romania," IJERPH, MDPI, vol. 18(24), pages 1-24, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:19:y:2022:i:20:p:13248-:d:942254. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.