IDEAS home Printed from https://ideas.repec.org/a/spr/jcsosc/v6y2023i2d10.1007_s42001-023-00200-3.html
   My bibliography  Save this article

COCO: an annotated Twitter dataset of COVID-19 conspiracy theories

Author

Listed:
  • Johannes Langguth

    (Simula Research Lab
    Norwegian Business School)

  • Daniel Thilo Schroeder

    (Simula Research Lab
    Oslo Metropolitan University)

  • Petra Filkuková

    (Simula Research Lab)

  • Stefan Brenner

    (Stuttgart Media University)

  • Jesper Phillips

    (Bates College)

  • Konstantin Pogorelov

    (Simula Research Lab)

Abstract

The COVID-19 pandemic has been accompanied by a surge of misinformation on social media which covered a wide range of different topics and contained many competing narratives, including conspiracy theories. To study such conspiracy theories, we created a dataset of 3495 tweets with manual labeling of the stance of each tweet w.r.t. 12 different conspiracy topics. The dataset thus contains almost 42,000 labels, each of which determined by majority among three expert annotators. The dataset was selected from COVID-19 related Twitter data spanning from January 2020 to June 2021 using a list of 54 keywords. The dataset can be used to train machine learning based classifiers for both stance and topic detection, either individually or simultaneously. BERT was used successfully for the combined task. The dataset can also be used to further study the prevalence of different conspiracy narratives. To this end we qualitatively analyze the tweets, discussing the structure of conspiracy narratives that are frequently found in the dataset. Furthermore, we illustrate the interconnection between the conspiracy categories as well as the keywords.

Suggested Citation

  • Johannes Langguth & Daniel Thilo Schroeder & Petra Filkuková & Stefan Brenner & Jesper Phillips & Konstantin Pogorelov, 2023. "COCO: an annotated Twitter dataset of COVID-19 conspiracy theories," Journal of Computational Social Science, Springer, vol. 6(2), pages 443-484, October.
  • Handle: RePEc:spr:jcsosc:v:6:y:2023:i:2:d:10.1007_s42001-023-00200-3
    DOI: 10.1007/s42001-023-00200-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s42001-023-00200-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s42001-023-00200-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Thomas Hale & Noam Angrist & Rafael Goldszmidt & Beatriz Kira & Anna Petherick & Toby Phillips & Samuel Webster & Emily Cameron-Blake & Laura Hallas & Saptarshi Majumdar & Helen Tatlow, 2021. "A global panel database of pandemic policies (Oxford COVID-19 Government Response Tracker)," Nature Human Behaviour, Nature, vol. 5(4), pages 529-538, April.
    2. Gordon Pennycook & Ziv Epstein & Mohsen Mosleh & Antonio A. Arechar & Dean Eckles & David G. Rand, 2021. "Shifting attention to accuracy can reduce misinformation online," Nature, Nature, vol. 592(7855), pages 590-595, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hannah Carver & Tracey Price & Danilo Falzon & Peter McCulloch & Tessa Parkes, 2022. "Stress and Wellbeing during the COVID-19 Pandemic: A Mixed-Methods Exploration of Frontline Homelessness Services Staff Experiences in Scotland," IJERPH, MDPI, vol. 19(6), pages 1-20, March.
    2. Xiao Chen & Hanwei Huang & Jiandong Ju & Ruoyan Sun & Jialiang Zhang, 2022. "Endogenous cross-region human mobility and pandemics," CEP Discussion Papers dp1860, Centre for Economic Performance, LSE.
    3. Yekaterina Chzhen & Jennifer Symonds & Dympna Devine & Júlia Mikolai & Susan Harkness & Seaneen Sloan & Gabriela Martinez Sainz, 2022. "Learning in a Pandemic: Primary School children’s Emotional Engagement with Remote Schooling during the spring 2020 Covid-19 Lockdown in Ireland," Child Indicators Research, Springer;The International Society of Child Indicators (ISCI), vol. 15(4), pages 1517-1538, August.
    4. Nicolás Ajzenman & Bruno Ferman & Sant’Anna Pedro C., 2023. "Discrimination in the Formation of Academic Networks: A Field Experiment on #EconTwitter," Working Papers 235, Red Nacional de Investigadores en Economía (RedNIE).
    5. Mirko Licchetta & Giovanni Mattozzi & Rafal Raciborski & Rupert Willis, 2022. "Economic Adjustment in the Euro Area and the United States during the COVID-19 Crisis," European Economy - Discussion Papers 160, Directorate General Economic and Financial Affairs (DG ECFIN), European Commission.
    6. Buechel, Berno & Klößner, Stefan & Meng, Fanyuan & Nassar, Anis, 2023. "Misinformation due to asymmetric information sharing," Journal of Economic Dynamics and Control, Elsevier, vol. 150(C).
    7. Lucia Freira & Marco Sartorio & Cynthia Boruchowicz & Florencia Lopez Boo & Joaquin Navajas, 2021. "The interplay between partisanship, forecasted COVID-19 deaths, and support for preventive policies," Palgrave Communications, Palgrave Macmillan, vol. 8(1), pages 1-10, December.
    8. Galil, Koresh & Varon, Eva, 2024. "National culture and banks stock volatility," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 91(C).
    9. Hammond, James & Siegal, Kim & Milner, Daniel & Elimu, Emmanuel & Vail, Taylor & Cathala, Paul & Gatera, Arsene & Karim, Azfar & Lee, Ja-Eun & Douxchamps, Sabine & Tu, Mai Thanh & Ouma, Emily & Lukuyu, 2022. "Perceived effects of COVID-19 restrictions on smallholder farmers: Evidence from seven lower- and middle-income countries," Agricultural Systems, Elsevier, vol. 198(C).
    10. Beckles, Jamila & Jackman, Mahalia, 2024. "Financial worry and government responses to the COVID-19 pandemic in 88 Countries: Did public confidence in National Governments matter?," Journal of Behavioral and Experimental Finance, Elsevier, vol. 43(C).
    11. Joseph B. Bak-Coleman & Ian Kennedy & Morgan Wack & Andrew Beers & Joseph S. Schafer & Emma S. Spiro & Kate Starbird & Jevin D. West, 2022. "Combining interventions to reduce the spread of viral misinformation," Nature Human Behaviour, Nature, vol. 6(10), pages 1372-1380, October.
    12. Winfree, Paul, 2023. "The long-run effects of temporarily closing schools: Evidence from Virginia, 1870s-1910s," QUCEH Working Paper Series 23-02, Queen's University Belfast, Queen's University Centre for Economic History.
    13. Xu,Yuanwei & Delius,Antonia Johanna Sophie & Pape,Utz Johann, 2022. "Gender Differences in Household Coping Strategies for COVID-19 in Kenya," Policy Research Working Paper Series 9959, The World Bank.
    14. Christoph Lindner & Ibolya Kotta & Eszter Eniko Marschalko & Kinga Szabo & Kinga Kalcza-Janosi & Jan Retelsdorf, 2022. "Increased Risk Perception, Distress Intolerance and Health Anxiety in Stricter Lockdowns: Self-Control as a Key Protective Factor in Early Response to the COVID-19 Pandemic," IJERPH, MDPI, vol. 19(9), pages 1-22, April.
    15. Elmarie Nel & Andrew MacLachlan & Ollie Ballinger & Hugh Cole & Megan Cole, 2023. "Data-Driven Decision Making in Response to the COVID-19 Pandemic: A City of Cape Town Case Study," Sustainability, MDPI, vol. 15(3), pages 1-18, January.
    16. Juan Grigera, 2022. "Adding Insult to Injury: The COVID‐19 Crisis Strikes Latin America," Development and Change, International Institute of Social Studies, vol. 53(6), pages 1335-1361, November.
    17. Valentina Aprigliano & Alessandro Borin & Francesco Paolo Conteduca & Simone Emiliozzi & Marco Flaccadoro & Sabina Marchetti & Stefania Villa, 2021. "Forecasting Italian GDP growth with epidemiological data," Questioni di Economia e Finanza (Occasional Papers) 664, Bank of Italy, Economic Research and International Relations Area.
    18. Maximilien Chaumon & Pier-Alexandre Rioux & Sophie K. Herbst & Ignacio Spiousas & Sebastian L. Kübel & Elisa M. Gallego Hiroyasu & Şerife Leman Runyun & Luigi Micillo & Vassilis Thanopoulos & Esteban , 2022. "The Blursday database as a resource to study subjective temporalities during COVID-19," Nature Human Behaviour, Nature, vol. 6(11), pages 1587-1599, November.
    19. Xuhao Shao & Ao Li & Chuansheng Chen & Elizabeth F. Loftus & Bi Zhu, 2023. "Cross-stage neural pattern similarity in the hippocampus predicts false memory derived from post-event inaccurate information," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    20. Daryna Grechyna, 2024. "Elections and policies: Evidence from the Covid pandemic," Kyklos, Wiley Blackwell, vol. 77(3), pages 812-831, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jcsosc:v:6:y:2023:i:2:d:10.1007_s42001-023-00200-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.