IDEAS home Printed from https://ideas.repec.org/a/spr/infosf/v21y2019i4d10.1007_s10796-018-9893-0.html
   My bibliography  Save this article

Social Media for Nowcasting Flu Activity: Spatio-Temporal Big Data Analysis

Author

Listed:
  • Amir Hassan Zadeh

    (Wright State University)

  • Hamed M. Zolbanin

    (Ball State University)

  • Ramesh Sharda

    (Oklahoma State University)

  • Dursun Delen

    (Oklahoma State University)

Abstract

Contagious diseases pose significant challenges to public healthcare systems all over the world. The rise in emerging contagious and infectious diseases has led to calls for the use of new techniques and technologies capable of detecting, tracking, mapping and managing behavioral patterns in such diseases. In this study, we used Big Data technologies to analyze two sets of flu (influenza) activity data: Twitter data were used to extract behavioral patterns from a location-based social network and to monitor flu outbreaks (and their locations) in the US, and Cerner HealthFacts data warehouse was used to track real-world clinical encounters. We expected that the integration (mashing) of social media and real-world clinical encounters could be a valuable enhancement to the existing surveillance systems. Our results verified that flu-related traffic on social media is closely related with actual flu outbreaks. However, rather than using simple Pearson correlation, which assumes a zero lag between the online and real-world activities, we used a multi-method data analytics approach to obtain the spatio-temporal cross-correlation between the two flu trends and to explain behavioral patterns during the flu season. We found that clinical flu encounters lag behind online posts. Also, we identified several public locations from which a majority of posts initiated. These findings can help health authorities develop more effective interventions (behavioral and/or otherwise) during the outbreaks to reduce the spread and impact, and to inform individuals about the locations they should avoid during those periods.

Suggested Citation

  • Amir Hassan Zadeh & Hamed M. Zolbanin & Ramesh Sharda & Dursun Delen, 2019. "Social Media for Nowcasting Flu Activity: Spatio-Temporal Big Data Analysis," Information Systems Frontiers, Springer, vol. 21(4), pages 743-760, August.
  • Handle: RePEc:spr:infosf:v:21:y:2019:i:4:d:10.1007_s10796-018-9893-0
    DOI: 10.1007/s10796-018-9893-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10796-018-9893-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10796-018-9893-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yanguang Chen, 2015. "A New Methodology of Spatial Cross-Correlation Analysis," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-20, May.
    2. Chris Allen & Ming-Hsiang Tsou & Anoshe Aslam & Anna Nagel & Jean-Mark Gawron, 2016. "Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza," PLOS ONE, Public Library of Science, vol. 11(7), pages 1-10, July.
    3. A S Fotheringham & D W S Wong, 1991. "The Modifiable Areal Unit Problem in Multivariate Statistical Analysis," Environment and Planning A, , vol. 23(7), pages 1025-1044, July.
    4. James B. Pick & Avijit Sarkar, 2015. "United States Digital Divide," Progress in IS, in: The Global Digital Divides, edition 127, chapter 0, pages 235-274, Springer.
    5. Vanja Dukic & Hedibert F. Lopes & Nicholas G. Polson, 2012. "Tracking Epidemics With Google Flu Trends Data and a State-Space SEIR Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1410-1426, December.
    6. Gabriele Prati & Luca Pietrantoni & Bruna Zani, 2011. "A Social‐Cognitive Model of Pandemic Influenza H1N1 Risk Perception and Recommended Behaviors in Italy," Risk Analysis, John Wiley & Sons, vol. 31(4), pages 645-656, April.
    7. Mohler, George, 2014. "Marked point process hotspot maps for homicide and gun crime prediction in Chicago," International Journal of Forecasting, Elsevier, vol. 30(3), pages 491-497.
    8. Jeremy Ginsberg & Matthew H. Mohebbi & Rajan S. Patel & Lynnette Brammer & Mark S. Smolinski & Larry Brilliant, 2009. "Detecting influenza epidemics using search engine query data," Nature, Nature, vol. 457(7232), pages 1012-1014, February.
    9. Marta C. González & César A. Hidalgo & Albert-László Barabási, 2009. "Understanding individual human mobility patterns," Nature, Nature, vol. 458(7235), pages 238-238, March.
    10. Pick, James B. & Sarkar, Avijit & Johnson, Jeremy, 2015. "United States digital divide: State level analysis of spatial clustering and multivariate determinants of ICT utilization," Socio-Economic Planning Sciences, Elsevier, vol. 49(C), pages 16-32.
    11. Granger, C W J, 1969. "Investigating Causal Relations by Econometric Models and Cross-Spectral Methods," Econometrica, Econometric Society, vol. 37(3), pages 424-438, July.
    12. Yi-Da Chen & Susan A. Brown & Paul Jen-Hwa Hu & Chwan-Chuen King & Hsinchun Chen, 2011. "Managing Emerging Infectious Diseases with Information Systems: Reconceptualizing Outbreak Management Through the Lens of Loose Coupling," Information Systems Research, INFORMS, vol. 22(3), pages 447-468, September.
    13. David A Broniatowski & Michael J Paul & Mark Dredze, 2013. "National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-1, December.
    14. Bang Viet Nguyen & Frada Burstein & Julie Fisher, 2015. "Improving service of online health information provision: A case of usage-driven design for health information portals," Information Systems Frontiers, Springer, vol. 17(3), pages 493-511, June.
    15. Koustav Rudra & Ashish Sharma & Niloy Ganguly & Muhammad Imran, 2018. "Classifying and Summarizing Information from Microblogs During Epidemics," Information Systems Frontiers, Springer, vol. 20(5), pages 933-948, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Silvia Chiusano & Tania Cerquitelli & Robert Wrembel & Daniele Quercia, 2021. "Breakthroughs on Cross-Cutting Data Management, Data Analytics, and Applied Data Science," Information Systems Frontiers, Springer, vol. 23(1), pages 1-7, February.
    2. Prabhsimran Singh & Surleen Kaur & Abdullah M. Baabdullah & Yogesh K. Dwivedi & Sandeep Sharma & Ravinder Singh Sawhney & Ronnie Das, 2023. "Is #SDG13 Trending Online? Insights from Climate Change Discussions on Twitter," Information Systems Frontiers, Springer, vol. 25(1), pages 199-219, February.
    3. Carlos Ferreira & Alessandro Merendino & Maureen Meadows, 2023. "Disruption and Legitimacy: Big Data in Society," Information Systems Frontiers, Springer, vol. 25(3), pages 1081-1100, June.
    4. María José Aramburu & Rafael Berlanga & Indira Lanza, 2020. "Social Media Multidimensional Analysis for Intelligent Health Surveillance," IJERPH, MDPI, vol. 17(7), pages 1-17, March.
    5. Osuji E. & Evans O., 2020. "Tourism Effects of Pandemics: New Insights from Novel Coronavirus," SPOUDAI Journal of Economics and Business, SPOUDAI Journal of Economics and Business, University of Piraeus, vol. 70(3-4), pages 56-65, July-Dece.
    6. Franco Arolfo & Kevin Cortés Rodriguez & Alejandro Vaisman, 2022. "Analyzing the Quality of Twitter Data Streams," Information Systems Frontiers, Springer, vol. 24(1), pages 349-369, February.
    7. Doruk Şen & Cem Çağrı Dönmez & Umman Mahir Yıldırım, 0. "A Hybrid Bi-level Metaheuristic for Credit Scoring," Information Systems Frontiers, Springer, vol. 0, pages 1-11.
    8. Luvai Motiwalla & Amit V. Deokar & Surendra Sarnikar & Angelika Dimoka, 2019. "Leveraging Data Analytics for Behavioral Research," Information Systems Frontiers, Springer, vol. 21(4), pages 735-742, August.
    9. Doruk Şen & Cem Çağrı Dönmez & Umman Mahir Yıldırım, 2020. "A Hybrid Bi-level Metaheuristic for Credit Scoring," Information Systems Frontiers, Springer, vol. 22(5), pages 1009-1019, October.
    10. Liu, Hongfei & Jayawardhena, Chanaka & Osburg, Victoria-Sophie & Yoganathan, Vignesh & Cartwright, Severina, 2021. "Social sharing of consumption emotion in electronic word of mouth (eWOM): A cross-media perspective," Journal of Business Research, Elsevier, vol. 132(C), pages 208-220.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fantazzini, Dean, 2020. "Short-term forecasting of the COVID-19 pandemic using Google Trends data: Evidence from 158 countries," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 59, pages 33-54.
    2. Viktor Ivanovich Blanutsa, 2022. "Regionalization of the Digital Economic Space: Contours of Emerging Approaches," Spatial Economics=Prostranstvennaya Ekonomika, Economic Research Institute, Far Eastern Branch, Russian Academy of Sciences (Khabarovsk, Russia), issue 2, pages 56-82.
    3. Silvana Rossy Brito & Aleksandra Socorro da Silva & Eulália Carvalho Mata & Nandamudi Lankalapalli Vijaykumar & Cláudio Alex Jorge Rocha & Maurílio Abreu Monteiro & João Crisóstomo Weyl Albuquerque Co, 0. "An approach to evaluate large-scale ICT training interventions," Information Systems Frontiers, Springer, vol. 0, pages 1-17.
    4. Cedric Mbanga & Ali F. Darrat & Jung Chul Park, 2019. "Investor sentiment and aggregate stock returns: the role of investor attention," Review of Quantitative Finance and Accounting, Springer, vol. 53(2), pages 397-428, August.
    5. Long Wen & Chang Liu & Haiyan Song, 2019. "Forecasting tourism demand using search query data: A hybrid modelling approach," Tourism Economics, , vol. 25(3), pages 309-329, May.
    6. M. Hubert & P. Rousseeuw & K. Vakili, 2014. "Shape bias of robust covariance estimators: an empirical study," Statistical Papers, Springer, vol. 55(1), pages 15-28, February.
    7. Alathur, Sreejith & Vigneswara Ilavarasan, P. & Gupta, M.P., 2016. "Determinants of e-participation in the citizens and the government initiatives: Insights from India," Socio-Economic Planning Sciences, Elsevier, vol. 55(C), pages 25-35.
    8. Samuel V Scarpino & James G Scott & Rosalind M Eggo & Bruce Clements & Nedialko B Dimitrov & Lauren Ancel Meyers, 2020. "Socioeconomic bias in influenza surveillance," PLOS Computational Biology, Public Library of Science, vol. 16(7), pages 1-19, July.
    9. Hongying Dai & Brian R. Lee & Jianqiang Hao, 2017. "Predicting Asthma Prevalence by Linking Social Media Data and Traditional Surveys," The ANNALS of the American Academy of Political and Social Science, , vol. 669(1), pages 75-92, January.
    10. Ilaria Bordino & Stefano Battiston & Guido Caldarelli & Matthieu Cristelli & Antti Ukkonen & Ingmar Weber, 2012. "Web Search Queries Can Predict Stock Market Volumes," PLOS ONE, Public Library of Science, vol. 7(7), pages 1-17, July.
    11. Zeynep Ertem & Dorrie Raymond & Lauren Ancel Meyers, 2018. "Optimal multi-source forecasting of seasonal influenza," PLOS Computational Biology, Public Library of Science, vol. 14(9), pages 1-16, September.
    12. Silvana Rossy Brito & Aleksandra Socorro da Silva & Eulália Carvalho Mata & Nandamudi Lankalapalli Vijaykumar & Cláudio Alex Jorge Rocha & Maurílio Abreu Monteiro & João Crisóstomo Weyl Albuquerque Co, 2018. "An approach to evaluate large-scale ICT training interventions," Information Systems Frontiers, Springer, vol. 20(4), pages 883-899, August.
    13. Jose L Herrera & Ravi Srinivasan & John S Brownstein & Alison P Galvani & Lauren Ancel Meyers, 2016. "Disease Surveillance on Complex Social Networks," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-16, July.
    14. Ibrahim Musa & Hyun Woo Park & Lkhagvadorj Munkhdalai & Keun Ho Ryu, 2018. "Global Research on Syndromic Surveillance from 1993 to 2017: Bibliometric Analysis and Visualization," Sustainability, MDPI, vol. 10(10), pages 1-20, September.
    15. Sarkar, Avijit & Pick, James B. & Johnson, Jeremy, 2015. "Africa's digital divide: Geography, policy, and implications," 2015 Regional ITS Conference, Los Angeles 2015 146339, International Telecommunications Society (ITS).
    16. Hilbert, Martin, 2016. "The bad news is that the digital access divide is here to stay: Domestically installed bandwidths among 172 countries for 1986–2014," Telecommunications Policy, Elsevier, vol. 40(6), pages 567-581.
    17. Valentina Lorenzoni & Gianni Andreozzi & Andrea Bazzani & Virginia Casigliani & Salvatore Pirri & Lara Tavoschi & Giuseppe Turchetti, 2022. "How Italy Tweeted about COVID-19: Detecting Reactions to the Pandemic from Social Media," IJERPH, MDPI, vol. 19(13), pages 1-14, June.
    18. Taesik Lee & Hayong Shin, 2016. "Combining syndromic surveillance and ILI data using particle filter for epidemic state estimation," Flexible Services and Manufacturing Journal, Springer, vol. 28(1), pages 233-253, June.
    19. Szeles, Monica Răileanu, 2018. "New insights from a multilevel approach to the regional digital divide in the European Union," Telecommunications Policy, Elsevier, vol. 42(6), pages 452-463.
    20. Saiz, Albert & Salazar-Miranda, Arianna, 2023. "Understanding Urban Economies, Land Use, and Social Dynamics in the City: Big Data and Measurement," IZA Discussion Papers 16501, Institute of Labor Economics (IZA).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infosf:v:21:y:2019:i:4:d:10.1007_s10796-018-9893-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.