IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v19y2022i9p5126-d800096.html
   My bibliography  Save this article

Fine-Tuning BERT Models to Classify Misinformation on Garlic and COVID-19 on Twitter

Author

Listed:
  • Myeong Gyu Kim

    (College of Pharmacy, Graduate School of Pharmaceutical Sciences, Ewha Womans University, Seoul 03760, Korea)

  • Minjung Kim

    (College of Pharmacy, Yonsei University, Incheon 21983, Korea)

  • Jae Hyun Kim

    (School of Pharmacy, Jeonbuk National University, Jeonju 54896, Korea)

  • Kyungim Kim

    (College of Pharmacy, Korea University, Sejong 30019, Korea)

Abstract

Garlic-related misinformation is prevalent whenever a virus outbreak occurs. With the outbreak of COVID-19, garlic-related misinformation is spreading through social media, including Twitter. Bidirectional Encoder Representations from Transformers (BERT) can be used to classify misinformation from a vast number of tweets. This study aimed to apply the BERT model for classifying misinformation on garlic and COVID-19 on Twitter, using 5929 original tweets mentioning garlic and COVID-19 (4151 for fine-tuning, 1778 for test). Tweets were manually labeled as ‘misinformation’ and ‘other.’ We fine-tuned five BERT models (BERT BASE , BERT LARGE , BERTweet-base, BERTweet-COVID-19, and BERTweet-large) using a general COVID-19 rumor dataset or a garlic-specific dataset. Accuracy and F1 score were calculated to evaluate the performance of the models. The BERT models fine-tuned with the COVID-19 rumor dataset showed poor performance, with maximum accuracy of 0.647. BERT models fine-tuned with the garlic-specific dataset showed better performance. BERTweet models achieved accuracy of 0.897–0.911, while BERT BASE and BERT LARGE achieved accuracy of 0.887–0.897. BERTweet-large showed the best performance with maximum accuracy of 0.911 and an F1 score of 0.894. Thus, BERT models showed good performance in classifying misinformation. The results of our study will help detect misinformation related to garlic and COVID-19 on Twitter.

Suggested Citation

  • Myeong Gyu Kim & Minjung Kim & Jae Hyun Kim & Kyungim Kim, 2022. "Fine-Tuning BERT Models to Classify Misinformation on Garlic and COVID-19 on Twitter," IJERPH, MDPI, vol. 19(9), pages 1-9, April.
  • Handle: RePEc:gam:jijerp:v:19:y:2022:i:9:p:5126-:d:800096
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/19/9/5126/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/19/9/5126/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:19:y:2022:i:9:p:5126-:d:800096. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.