IDEAS home Printed from https://ideas.repec.org/a/spr/infosf/v21y2019i4d10.1007_s10796-018-9893-0.html
   My bibliography  Save this article

Social Media for Nowcasting Flu Activity: Spatio-Temporal Big Data Analysis

Author

Listed:
  • Amir Hassan Zadeh

    (Wright State University)

  • Hamed M. Zolbanin

    (Ball State University)

  • Ramesh Sharda

    (Oklahoma State University)

  • Dursun Delen

    (Oklahoma State University)

Abstract

Contagious diseases pose significant challenges to public healthcare systems all over the world. The rise in emerging contagious and infectious diseases has led to calls for the use of new techniques and technologies capable of detecting, tracking, mapping and managing behavioral patterns in such diseases. In this study, we used Big Data technologies to analyze two sets of flu (influenza) activity data: Twitter data were used to extract behavioral patterns from a location-based social network and to monitor flu outbreaks (and their locations) in the US, and Cerner HealthFacts data warehouse was used to track real-world clinical encounters. We expected that the integration (mashing) of social media and real-world clinical encounters could be a valuable enhancement to the existing surveillance systems. Our results verified that flu-related traffic on social media is closely related with actual flu outbreaks. However, rather than using simple Pearson correlation, which assumes a zero lag between the online and real-world activities, we used a multi-method data analytics approach to obtain the spatio-temporal cross-correlation between the two flu trends and to explain behavioral patterns during the flu season. We found that clinical flu encounters lag behind online posts. Also, we identified several public locations from which a majority of posts initiated. These findings can help health authorities develop more effective interventions (behavioral and/or otherwise) during the outbreaks to reduce the spread and impact, and to inform individuals about the locations they should avoid during those periods.

Suggested Citation

  • Amir Hassan Zadeh & Hamed M. Zolbanin & Ramesh Sharda & Dursun Delen, 2019. "Social Media for Nowcasting Flu Activity: Spatio-Temporal Big Data Analysis," Information Systems Frontiers, Springer, vol. 21(4), pages 743-760, August.
  • Handle: RePEc:spr:infosf:v:21:y:2019:i:4:d:10.1007_s10796-018-9893-0
    DOI: 10.1007/s10796-018-9893-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10796-018-9893-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10796-018-9893-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yanguang Chen, 2015. "A New Methodology of Spatial Cross-Correlation Analysis," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-20, May.
    2. A S Fotheringham & D W S Wong, 1991. "The Modifiable Areal Unit Problem in Multivariate Statistical Analysis," Environment and Planning A, , vol. 23(7), pages 1025-1044, July.
    3. Granger, C W J, 1969. "Investigating Causal Relations by Econometric Models and Cross-Spectral Methods," Econometrica, Econometric Society, vol. 37(3), pages 424-438, July.
    4. Koustav Rudra & Ashish Sharma & Niloy Ganguly & Muhammad Imran, 2018. "Classifying and Summarizing Information from Microblogs During Epidemics," Information Systems Frontiers, Springer, vol. 20(5), pages 933-948, October.
    5. Chris Allen & Ming-Hsiang Tsou & Anoshe Aslam & Anna Nagel & Jean-Mark Gawron, 2016. "Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza," PLOS ONE, Public Library of Science, vol. 11(7), pages 1-10, July.
    6. James B. Pick & Avijit Sarkar, 2015. "United States Digital Divide," Progress in IS, in: The Global Digital Divides, edition 127, chapter 0, pages 235-274, Springer.
    7. Vanja Dukic & Hedibert F. Lopes & Nicholas G. Polson, 2012. "Tracking Epidemics With Google Flu Trends Data and a State-Space SEIR Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1410-1426, December.
    8. Gabriele Prati & Luca Pietrantoni & Bruna Zani, 2011. "A Social‐Cognitive Model of Pandemic Influenza H1N1 Risk Perception and Recommended Behaviors in Italy," Risk Analysis, John Wiley & Sons, vol. 31(4), pages 645-656, April.
    9. Mohler, George, 2014. "Marked point process hotspot maps for homicide and gun crime prediction in Chicago," International Journal of Forecasting, Elsevier, vol. 30(3), pages 491-497.
    10. Jeremy Ginsberg & Matthew H. Mohebbi & Rajan S. Patel & Lynnette Brammer & Mark S. Smolinski & Larry Brilliant, 2009. "Detecting influenza epidemics using search engine query data," Nature, Nature, vol. 457(7232), pages 1012-1014, February.
    11. Marta C. González & César A. Hidalgo & Albert-László Barabási, 2009. "Understanding individual human mobility patterns," Nature, Nature, vol. 458(7235), pages 238-238, March.
    12. Pick, James B. & Sarkar, Avijit & Johnson, Jeremy, 2015. "United States digital divide: State level analysis of spatial clustering and multivariate determinants of ICT utilization," Socio-Economic Planning Sciences, Elsevier, vol. 49(C), pages 16-32.
    13. Yi-Da Chen & Susan A. Brown & Paul Jen-Hwa Hu & Chwan-Chuen King & Hsinchun Chen, 2011. "Managing Emerging Infectious Diseases with Information Systems: Reconceptualizing Outbreak Management Through the Lens of Loose Coupling," Information Systems Research, INFORMS, vol. 22(3), pages 447-468, September.
    14. David A Broniatowski & Michael J Paul & Mark Dredze, 2013. "National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic," PLOS ONE, Public Library of Science, vol. 8(12), pages 1-1, December.
    15. Bang Viet Nguyen & Frada Burstein & Julie Fisher, 2015. "Improving service of online health information provision: A case of usage-driven design for health information portals," Information Systems Frontiers, Springer, vol. 17(3), pages 493-511, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Prabhsimran Singh & Surleen Kaur & Abdullah M. Baabdullah & Yogesh K. Dwivedi & Sandeep Sharma & Ravinder Singh Sawhney & Ronnie Das, 2023. "Is #SDG13 Trending Online? Insights from Climate Change Discussions on Twitter," Information Systems Frontiers, Springer, vol. 25(1), pages 199-219, February.
    2. María José Aramburu & Rafael Berlanga & Indira Lanza, 2020. "Social Media Multidimensional Analysis for Intelligent Health Surveillance," IJERPH, MDPI, vol. 17(7), pages 1-17, March.
    3. Osuji E. & Evans O., 2020. "Tourism Effects of Pandemics: New Insights from Novel Coronavirus," SPOUDAI Journal of Economics and Business, SPOUDAI Journal of Economics and Business, University of Piraeus, vol. 70(3-4), pages 56-65, July-Dece.
    4. Franco Arolfo & Kevin Cortés Rodriguez & Alejandro Vaisman, 2022. "Analyzing the Quality of Twitter Data Streams," Information Systems Frontiers, Springer, vol. 24(1), pages 349-369, February.
    5. Doruk Şen & Cem Çağrı Dönmez & Umman Mahir Yıldırım, 2020. "A Hybrid Bi-level Metaheuristic for Credit Scoring," Information Systems Frontiers, Springer, vol. 22(5), pages 1009-1019, October.
    6. Liu, Hongfei & Jayawardhena, Chanaka & Osburg, Victoria-Sophie & Yoganathan, Vignesh & Cartwright, Severina, 2021. "Social sharing of consumption emotion in electronic word of mouth (eWOM): A cross-media perspective," Journal of Business Research, Elsevier, vol. 132(C), pages 208-220.
    7. Silvia Chiusano & Tania Cerquitelli & Robert Wrembel & Daniele Quercia, 2021. "Breakthroughs on Cross-Cutting Data Management, Data Analytics, and Applied Data Science," Information Systems Frontiers, Springer, vol. 23(1), pages 1-7, February.
    8. Carlos Ferreira & Alessandro Merendino & Maureen Meadows, 2023. "Disruption and Legitimacy: Big Data in Society," Information Systems Frontiers, Springer, vol. 25(3), pages 1081-1100, June.
    9. Doruk Şen & Cem Çağrı Dönmez & Umman Mahir Yıldırım, 0. "A Hybrid Bi-level Metaheuristic for Credit Scoring," Information Systems Frontiers, Springer, vol. 0, pages 1-11.
    10. Luvai Motiwalla & Amit V. Deokar & Surendra Sarnikar & Angelika Dimoka, 2019. "Leveraging Data Analytics for Behavioral Research," Information Systems Frontiers, Springer, vol. 21(4), pages 735-742, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Fantazzini, Dean, 2020. "Short-term forecasting of the COVID-19 pandemic using Google Trends data: Evidence from 158 countries," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 59, pages 33-54.
    2. Cedric Mbanga & Ali F. Darrat & Jung Chul Park, 2019. "Investor sentiment and aggregate stock returns: the role of investor attention," Review of Quantitative Finance and Accounting, Springer, vol. 53(2), pages 397-428, August.
    3. M. Hubert & P. Rousseeuw & K. Vakili, 2014. "Shape bias of robust covariance estimators: an empirical study," Statistical Papers, Springer, vol. 55(1), pages 15-28, February.
    4. Alathur, Sreejith & Vigneswara Ilavarasan, P. & Gupta, M.P., 2016. "Determinants of e-participation in the citizens and the government initiatives: Insights from India," Socio-Economic Planning Sciences, Elsevier, vol. 55(C), pages 25-35.
    5. Hongying Dai & Brian R. Lee & Jianqiang Hao, 2017. "Predicting Asthma Prevalence by Linking Social Media Data and Traditional Surveys," The ANNALS of the American Academy of Political and Social Science, , vol. 669(1), pages 75-92, January.
    6. Zeynep Ertem & Dorrie Raymond & Lauren Ancel Meyers, 2018. "Optimal multi-source forecasting of seasonal influenza," PLOS Computational Biology, Public Library of Science, vol. 14(9), pages 1-16, September.
    7. Silvana Rossy Brito & Aleksandra Socorro da Silva & Eulália Carvalho Mata & Nandamudi Lankalapalli Vijaykumar & Cláudio Alex Jorge Rocha & Maurílio Abreu Monteiro & João Crisóstomo Weyl Albuquerque Co, 2018. "An approach to evaluate large-scale ICT training interventions," Information Systems Frontiers, Springer, vol. 20(4), pages 883-899, August.
    8. Jose L Herrera & Ravi Srinivasan & John S Brownstein & Alison P Galvani & Lauren Ancel Meyers, 2016. "Disease Surveillance on Complex Social Networks," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-16, July.
    9. Ibrahim Musa & Hyun Woo Park & Lkhagvadorj Munkhdalai & Keun Ho Ryu, 2018. "Global Research on Syndromic Surveillance from 1993 to 2017: Bibliometric Analysis and Visualization," Sustainability, MDPI, vol. 10(10), pages 1-20, September.
    10. Sarkar, Avijit & Pick, James B. & Johnson, Jeremy, 2015. "Africa's digital divide: Geography, policy, and implications," 2015 Regional ITS Conference, Los Angeles 2015 146339, International Telecommunications Society (ITS).
    11. Hilbert, Martin, 2016. "The bad news is that the digital access divide is here to stay: Domestically installed bandwidths among 172 countries for 1986–2014," Telecommunications Policy, Elsevier, vol. 40(6), pages 567-581.
    12. Valentina Lorenzoni & Gianni Andreozzi & Andrea Bazzani & Virginia Casigliani & Salvatore Pirri & Lara Tavoschi & Giuseppe Turchetti, 2022. "How Italy Tweeted about COVID-19: Detecting Reactions to the Pandemic from Social Media," IJERPH, MDPI, vol. 19(13), pages 1-14, June.
    13. Jingwei Li & Choon-Ling Sia & Zhuo Chen & Wei Huang, 2021. "Enhancing Influenza Epidemics Forecasting Accuracy in China with Both Official and Unofficial Online News Articles, 2019–2020," IJERPH, MDPI, vol. 18(12), pages 1-13, June.
    14. Ilaria Bordino & Stefano Battiston & Guido Caldarelli & Matthieu Cristelli & Antti Ukkonen & Ingmar Weber, 2012. "Web Search Queries Can Predict Stock Market Volumes," PLOS ONE, Public Library of Science, vol. 7(7), pages 1-17, July.
    15. Zhengming Xing & Bradley Nicholson & Monica Jimenez & Timothy Veldman & Lori Hudson & Joseph Lucas & David Dunson & Aimee K. Zaas & Christopher W. Woods & Geoffrey S. Ginsburg & Lawrence Carin, 2014. "Bayesian modeling of temporal properties of infectious disease in a college student population," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(6), pages 1358-1382, June.
    16. Qiong Jia & Yue Guo & Guanlin Wang & Stuart J. Barnes, 2020. "Big Data Analytics in the Fight against Major Public Health Incidents (Including COVID-19): A Conceptual Framework," IJERPH, MDPI, vol. 17(17), pages 1-21, August.
    17. Wang, Di & Zhou, Tao & Lan, Feng & Wang, Mengmeng, 2021. "ICT and socio-economic development: Evidence from a spatial panel data analysis in China," Telecommunications Policy, Elsevier, vol. 45(7).
    18. Serguei Saavedra & Jordi Duch & Brian Uzzi, 2011. "Tracking Traders' Understanding of the Market Using e-Communication Data," PLOS ONE, Public Library of Science, vol. 6(10), pages 1-7, October.
    19. Ali, Mohammad Afshar & Alam, Khorshed & Taylor, Brad, 2020. "Measuring the concentration of information and communication technology infrastructure in Australia: Do affordability and remoteness matter?," Socio-Economic Planning Sciences, Elsevier, vol. 70(C).
    20. Véronique Flambard & Nicolas Vaillant & François-Charles Wolff, 2011. "Dating as Leisure," Chapters, in: Samuel Cameron (ed.), Handbook on the Economics of Leisure, chapter 9, Edward Elgar Publishing.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infosf:v:21:y:2019:i:4:d:10.1007_s10796-018-9893-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.