IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v8y2023i11p163-d1268303.html
   My bibliography  Save this article

A Large-Scale Dataset of Search Interests Related to Disease X Originating from Different Geographic Regions

Author

Listed:
  • Nirmalya Thakur

    (Department of Computer Science, Emory University, Atlanta, GA 30322, USA)

  • Shuqi Cui

    (Department of Computer Science, Emory University, Atlanta, GA 30322, USA)

  • Kesha A. Patel

    (Department of Mathematics, Emory University, Atlanta, GA 30322, USA)

  • Isabella Hall

    (Department of Computer Science, University of Cincinnati, Cincinnati, OH 45221, USA)

  • Yuvraj Nihal Duggal

    (Department of Computer Science, Emory University, Atlanta, GA 30322, USA)

Abstract

The World Health Organization (WHO) added Disease X to their shortlist of blueprint priority diseases to represent a hypothetical, unknown pathogen that could cause a future epidemic. During different virus outbreaks of the past, such as COVID-19, Influenza, Lyme Disease, and Zika virus, researchers from various disciplines utilized Google Trends to mine multimodal components of web behavior to study, investigate, and analyze the global awareness, preparedness, and response associated with these respective virus outbreaks. As the world prepares for Disease X, a dataset on web behavior related to Disease X would be crucial to contribute towards the timely advancement of research in this field. Furthermore, none of the prior works in this field have focused on the development of a dataset to compile relevant web behavior data, which would help to prepare for Disease X. To address these research challenges, this work presents a dataset of web behavior related to Disease X, which emerged from different geographic regions of the world, between February 2018 and August 2023. Specifically, this dataset presents the search interests related to Disease X from 94 geographic regions. These regions were chosen for data mining as these regions recorded significant search interests related to Disease X during this timeframe. The dataset was developed by collecting data using Google Trends. The relevant search interests for all these regions for each month in this time range are available in this dataset. This paper also discusses the compliance of this dataset with the FAIR principles of scientific data management. Finally, an analysis of this dataset is presented to uphold the applicability, relevance, and usefulness of this dataset for the investigation of different research questions in the interrelated fields of Big Data, Data Mining, Healthcare, Epidemiology, and Data Analysis with a specific focus on Disease X.

Suggested Citation

  • Nirmalya Thakur & Shuqi Cui & Kesha A. Patel & Isabella Hall & Yuvraj Nihal Duggal, 2023. "A Large-Scale Dataset of Search Interests Related to Disease X Originating from Different Geographic Regions," Data, MDPI, vol. 8(11), pages 1-24, October.
  • Handle: RePEc:gam:jdataj:v:8:y:2023:i:11:p:163-:d:1268303
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/8/11/163/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/8/11/163/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Dorsa Mohammadi Arezooji, 2021. "A Big Data Analysis of the Ethereum Network: from Blockchain to Google Trends," Papers 2104.01764, arXiv.org.
    2. Kate E. Jones & Nikkita G. Patel & Marc A. Levy & Adam Storeygard & Deborah Balk & John L. Gittleman & Peter Daszak, 2008. "Global trends in emerging infectious diseases," Nature, Nature, vol. 451(7181), pages 990-993, February.
    3. Arnstein Aassve & Guido Alfani & Francesco Gandolfi & Marco Le Moglie, 2021. "Epidemics and trust: The case of the Spanish Flu," Health Economics, John Wiley & Sons, Ltd., vol. 30(4), pages 840-857, April.
    4. Nirmalya Thakur & Chia Y. Han, 2021. "Country-Specific Interests towards Fall Detection from 2004–2021: An Open Access Dataset and Research Questions," Data, MDPI, vol. 6(8), pages 1-21, August.
    5. Jeremy Ginsberg & Matthew H. Mohebbi & Rajan S. Patel & Lynnette Brammer & Mark S. Smolinski & Larry Brilliant, 2009. "Detecting influenza epidemics using search engine query data," Nature, Nature, vol. 457(7232), pages 1012-1014, February.
    6. Rodrigo Mulero & Alfredo García-Hiernaux, 2021. "Forecasting Spanish unemployment with Google Trends and dimension reduction techniques," SERIEs: Journal of the Spanish Economic Association, Springer;Spanish Economic Association, vol. 12(3), pages 329-349, September.
    7. Jalan, Akanksha & Matkovskyy, Roman & Urquhart, Andrew & Yarovaya, Larisa, 2023. "The role of interpersonal trust in cryptocurrency adoption," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 83(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ibrahim Musa & Hyun Woo Park & Lkhagvadorj Munkhdalai & Keun Ho Ryu, 2018. "Global Research on Syndromic Surveillance from 1993 to 2017: Bibliometric Analysis and Visualization," Sustainability, MDPI, vol. 10(10), pages 1-20, September.
    2. Qiong Jia & Yue Guo & Guanlin Wang & Stuart J. Barnes, 2020. "Big Data Analytics in the Fight against Major Public Health Incidents (Including COVID-19): A Conceptual Framework," IJERPH, MDPI, vol. 17(17), pages 1-21, August.
    3. Nikolett Orosz & Tünde Tóthné Tóth & Gyöngyi Vargáné Gyuró & Zsoltné Tibor Nábrádi & Klára Hegedűsné Sorosi & Zsuzsa Nagy & Éva Rigó & Ádám Kaposi & Gabriella Gömöri & Cornelia Melinda Adi Santoso & A, 2022. "Comparison of Length of Hospital Stay for Community-Acquired Infections Due to Enteric Pathogens, Influenza Viruses and Multidrug-Resistant Bacteria: A Cross-Sectional Study in Hungary," IJERPH, MDPI, vol. 19(23), pages 1-16, November.
    4. Eichengreen, Barry & Aksoy, Cevat Giray & Saka, Orkun, 2021. "Revenge of the experts: Will COVID-19 renew or diminish public trust in science?," Journal of Public Economics, Elsevier, vol. 193(C).
    5. Mudassar Arsalan & Omar Mubin & Fady Alnajjar & Belal Alsinglawi, 2020. "COVID-19 Global Risk: Expectation vs. Reality," IJERPH, MDPI, vol. 17(15), pages 1-10, August.
    6. David H Chae & Sean Clouston & Mark L Hatzenbuehler & Michael R Kramer & Hannah L F Cooper & Sacoby M Wilson & Seth I Stephens-Davidowitz & Robert S Gold & Bruce G Link, 2015. "Association between an Internet-Based Measure of Area Racism and Black Mortality," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-12, April.
    7. Xiaoli Wang & Shuangsheng Wu & C Raina MacIntyre & Hongbin Zhang & Weixian Shi & Xiaomin Peng & Wei Duan & Peng Yang & Yi Zhang & Quanyi Wang, 2015. "Using an Adjusted Serfling Regression Model to Improve the Early Warning at the Arrival of Peak Timing of Influenza in Beijing," PLOS ONE, Public Library of Science, vol. 10(3), pages 1-14, March.
    8. Ishani Chaudhuri & Parthajit Kayal, 2022. "Predicting Power of Ticker Search Volume in Indian Stock Market," Working Papers 2022-214, Madras School of Economics,Chennai,India.
    9. Yang, Xin & Pan, Bing & Evans, James A. & Lv, Benfu, 2015. "Forecasting Chinese tourist volume with search engine data," Tourism Management, Elsevier, vol. 46(C), pages 386-397.
    10. Kuchler, Theresa & Russel, Dominic & Stroebel, Johannes, 2022. "JUE Insight: The geographic spread of COVID-19 correlates with the structure of social networks as measured by Facebook," Journal of Urban Economics, Elsevier, vol. 127(C).
    11. Ceddia, M.G. & Bardsley, N.O. & Goodwin, R. & Holloway, G.J. & Nocella, G. & Stasi, A., 2013. "A complex system perspective on the emergence and spread of infectious diseases: Integrating economic and ecological aspects," Ecological Economics, Elsevier, vol. 90(C), pages 124-131.
    12. Markowitz, Sara & Nesson, Erik & Robinson, Joshua J., 2019. "The effects of employment on influenza rates," Economics & Human Biology, Elsevier, vol. 34(C), pages 286-295.
    13. Bentzen, Jeanet Sinding, 2021. "In crisis, we pray: Religiosity and the COVID-19 pandemic," Journal of Economic Behavior & Organization, Elsevier, vol. 192(C), pages 541-583.
    14. John M Drake & Tobias S Brett & Shiyang Chen & Bogdan I Epureanu & Matthew J Ferrari & Éric Marty & Paige B Miller & Eamon B O’Dea & Suzanne M O’Regan & Andrew W Park & Pejman Rohani, 2019. "The statistics of epidemic transitions," PLOS Computational Biology, Public Library of Science, vol. 15(5), pages 1-14, May.
    15. Ongolo, Symphorien & Giessen, Lukas & Karsenty, Alain & Tchamba, Martin & Krott, Max, 2021. "Forestland policies and politics in Africa: Recent evidence and new challenges," Forest Policy and Economics, Elsevier, vol. 127(C).
    16. Jesse T. Richman & Ryan J. Roberts, 2023. "Assessing Spurious Correlations in Big Search Data," Forecasting, MDPI, vol. 5(1), pages 1-12, February.
    17. Linus Schiöler & Marianne Fris�n, 2012. "Multivariate outbreak detection," Journal of Applied Statistics, Taylor & Francis Journals, vol. 39(2), pages 223-242, April.
    18. Sasikiran Kandula & Jeffrey Shaman, 2019. "Reappraising the utility of Google Flu Trends," PLOS Computational Biology, Public Library of Science, vol. 15(8), pages 1-16, August.
    19. Paige, Sarah B. & Malavé, Carly & Mbabazi, Edith & Mayer, Jonathan & Goldberg, Tony L., 2015. "Uncovering zoonoses awareness in an emerging disease ‘hotspot’," Social Science & Medicine, Elsevier, vol. 129(C), pages 78-86.
    20. Jianhua Wang & Guan-Zhu Han, 2023. "Genome mining shows that retroviruses are pervasively invading vertebrate genomes," Nature Communications, Nature, vol. 14(1), pages 1-11, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:8:y:2023:i:11:p:163-:d:1268303. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.