IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1007518.html
   My bibliography  Save this article

Forecasting dengue and influenza incidences using a sparse representation of Google trends, electronic health records, and time series data

Author

Listed:
  • Prashant Rangarajan
  • Sandeep K Mody
  • Madhav Marathe

Abstract

Dengue and influenza-like illness (ILI) are two of the leading causes of viral infection in the world and it is estimated that more than half the world’s population is at risk for developing these infections. It is therefore important to develop accurate methods for forecasting dengue and ILI incidences. Since data from multiple sources (such as dengue and ILI case counts, electronic health records and frequency of multiple internet search terms from Google Trends) can improve forecasts, standard time series analysis methods are inadequate to estimate all the parameter values from the limited amount of data available if we use multiple sources. In this paper, we use a computationally efficient implementation of the known variable selection method that we call the Autoregressive Likelihood Ratio (ARLR) method. This method combines sparse representation of time series data, electronic health records data (for ILI) and Google Trends data to forecast dengue and ILI incidences. This sparse representation method uses an algorithm that maximizes an appropriate likelihood ratio at every step. Using numerical experiments, we demonstrate that our method recovers the underlying sparse model much more accurately than the lasso method. We apply our method to dengue case count data from five countries/states: Brazil, Mexico, Singapore, Taiwan, and Thailand and to ILI case count data from the United States. Numerical experiments show that our method outperforms existing time series forecasting methods in forecasting the dengue and ILI case counts. In particular, our method gives a 18 percent forecast error reduction over a leading method that also uses data from multiple sources. It also performs better than other methods in predicting the peak value of the case count and the peak time.Author summary: Dengue and influenza-like illness (ILI) are leading causes of viral infection in the world and hence it is important to develop accurate methods for forecasting their incidence. We use Autoregressive Likelihood Ratio method, which is a computationally efficient implementation of the variable selection method, in order to obtain a sparse (non-lasso) representation of time series, Google Trends and electronic health records (for ILI) data. This method is used to forecast dengue incidence in five countries/states and ILI incidence in USA. We show that this method outperforms existing time series methods in forecasting these diseases. The method is general and can also be used to forecast other diseases.

Suggested Citation

  • Prashant Rangarajan & Sandeep K Mody & Madhav Marathe, 2019. "Forecasting dengue and influenza incidences using a sparse representation of Google trends, electronic health records, and time series data," PLOS Computational Biology, Public Library of Science, vol. 15(11), pages 1-24, November.
  • Handle: RePEc:plo:pcbi00:1007518
    DOI: 10.1371/journal.pcbi.1007518
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1007518
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1007518&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1007518?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Evan L Ray & Nicholas G Reich, 2018. "Prediction of infectious disease epidemics via weighted density ensembles," PLOS Computational Biology, Public Library of Science, vol. 14(2), pages 1-23, February.
    2. Helmut Lütkepohl, 2005. "New Introduction to Multiple Time Series Analysis," Springer Books, Springer, number 978-3-540-27752-1, January.
    3. Samir Bhatt & Peter W. Gething & Oliver J. Brady & Jane P. Messina & Andrew W. Farlow & Catherine L. Moyes & John M. Drake & John S. Brownstein & Anne G. Hoen & Osman Sankoh & Monica F. Myers & Dylan , 2013. "The global distribution and burden of dengue," Nature, Nature, vol. 496(7446), pages 504-507, April.
    4. Aditya Lia Ramadona & Lutfan Lazuardi & Yien Ling Hii & Åsa Holmner & Hari Kusnanto & Joacim Rocklöv, 2016. "Prediction of Dengue Outbreaks Based on Disease Surveillance and Meteorological Data," PLOS ONE, Public Library of Science, vol. 11(3), pages 1-18, March.
    5. Zeynep Ertem & Dorrie Raymond & Lauren Ancel Meyers, 2018. "Optimal multi-source forecasting of seasonal influenza," PLOS Computational Biology, Public Library of Science, vol. 14(9), pages 1-16, September.
    6. Anna L Buczak & Benjamin Baugher & Linda J Moniz & Thomas Bagley & Steven M Babin & Erhan Guven, 2018. "Ensemble method for dengue prediction," PLOS ONE, Public Library of Science, vol. 13(1), pages 1-23, January.
    7. Prithwish Chakraborty & Bryan Lewis & Stephen Eubank & John S Brownstein & Madhav Marathe & Naren Ramakrishnan, 2018. "What to know before forecasting the flu," PLOS Computational Biology, Public Library of Science, vol. 14(10), pages 1-7, October.
    8. Teresa K Yamana & Sasikiran Kandula & Jeffrey Shaman, 2017. "Individual versus superensemble forecasts of seasonal influenza outbreaks in the United States," PLOS Computational Biology, Public Library of Science, vol. 13(11), pages 1-17, November.
    9. Logan C Brooks & David C Farrow & Sangwon Hyun & Ryan J Tibshirani & Roni Rosenfeld, 2018. "Nonmechanistic forecasts of seasonal influenza with iterative one-week-ahead distributions," PLOS Computational Biology, Public Library of Science, vol. 14(6), pages 1-29, June.
    10. Cecilia de Almeida Marques-Toledo & Carolin Marlen Degener & Livia Vinhal & Giovanini Coelho & Wagner Meira & Claudia Torres Codeço & Mauro Martins Teixeira, 2017. "Dengue prediction by the web: Tweets are a useful tool for estimating and forecasting Dengue at country and city level," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 11(7), pages 1-20, July.
    11. Jean-Paul Chretien & Dylan George & Jeffrey Shaman & Rohit A Chitale & F Ellis McKenzie, 2014. "Influenza Forecasting in Human Populations: A Scoping Review," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-8, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    2. Panja, Madhurima & Chakraborty, Tanujit & Nadim, Sk Shahid & Ghosh, Indrajit & Kumar, Uttam & Liu, Nan, 2023. "An ensemble neural network approach to forecast Dengue outbreak based on climatic condition," Chaos, Solitons & Fractals, Elsevier, vol. 167(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nicholas G Reich & Craig J McGowan & Teresa K Yamana & Abhinav Tushar & Evan L Ray & Dave Osthus & Sasikiran Kandula & Logan C Brooks & Willow Crawford-Crudell & Graham Casey Gibson & Evan Moore & Reb, 2019. "Accuracy of real-time multi-model ensemble forecasts for seasonal influenza in the U.S," PLOS Computational Biology, Public Library of Science, vol. 15(11), pages 1-19, November.
    2. Michal Ben-Nun & Pete Riley & James Turtle & David P Bacon & Steven Riley, 2019. "Forecasting national and regional influenza-like illness for the USA," PLOS Computational Biology, Public Library of Science, vol. 15(5), pages 1-20, May.
    3. John M Drake & Tobias S Brett & Shiyang Chen & Bogdan I Epureanu & Matthew J Ferrari & Éric Marty & Paige B Miller & Eamon B O’Dea & Suzanne M O’Regan & Andrew W Park & Pejman Rohani, 2019. "The statistics of epidemic transitions," PLOS Computational Biology, Public Library of Science, vol. 15(5), pages 1-14, May.
    4. Fantazzini, Dean, 2020. "Short-term forecasting of the COVID-19 pandemic using Google Trends data: Evidence from 158 countries," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 59, pages 33-54.
    5. Laith Hussain-Alkhateeb & Tatiana Rivera Ramírez & Axel Kroeger & Ernesto Gozzer & Silvia Runge-Ranzinger, 2021. "Early warning systems (EWSs) for chikungunya, dengue, malaria, yellow fever, and Zika outbreaks: What is the evidence? A scoping review," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 15(9), pages 1-25, September.
    6. Zhichao Li, 2022. "Forecasting Weekly Dengue Cases by Integrating Google Earth Engine-Based Risk Predictor Generation and Google Colab-Based Deep Learning Modeling in Fortaleza and the Federal District, Brazil," IJERPH, MDPI, vol. 19(20), pages 1-16, October.
    7. Junyi Lu & Sebastian Meyer, 2020. "Forecasting Flu Activity in the United States: Benchmarking an Endemic-Epidemic Beta Model," IJERPH, MDPI, vol. 17(4), pages 1-13, February.
    8. Zeynep Ertem & Dorrie Raymond & Lauren Ancel Meyers, 2018. "Optimal multi-source forecasting of seasonal influenza," PLOS Computational Biology, Public Library of Science, vol. 14(9), pages 1-16, September.
    9. Bernard Bett & Delia Grace & Hu Suk Lee & Johanna Lindahl & Hung Nguyen-Viet & Pham-Duc Phuc & Nguyen Huu Quyen & Tran Anh Tu & Tran Dac Phu & Dang Quang Tan & Vu Sinh Nam, 2019. "Spatiotemporal analysis of historical records (2001–2012) on dengue fever in Vietnam and development of a statistical model for forecasting risk," PLOS ONE, Public Library of Science, vol. 14(11), pages 1-22, November.
    10. Ray, Evan L. & Brooks, Logan C. & Bien, Jacob & Biggerstaff, Matthew & Bosse, Nikos I. & Bracher, Johannes & Cramer, Estee Y. & Funk, Sebastian & Gerding, Aaron & Johansson, Michael A. & Rumack, Aaron, 2023. "Comparing trained and untrained probabilistic ensemble forecasts of COVID-19 cases and deaths in the United States," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1366-1383.
    11. Panja, Madhurima & Chakraborty, Tanujit & Nadim, Sk Shahid & Ghosh, Indrajit & Kumar, Uttam & Liu, Nan, 2023. "An ensemble neural network approach to forecast Dengue outbreak based on climatic condition," Chaos, Solitons & Fractals, Elsevier, vol. 167(C).
    12. Oswaldo Santos Baquero & Lidia Maria Reis Santana & Francisco Chiaravalloti-Neto, 2018. "Dengue forecasting in São Paulo city with generalized additive models, artificial neural networks and seasonal autoregressive integrated moving average models," PLOS ONE, Public Library of Science, vol. 13(4), pages 1-12, April.
    13. Chathurika Hettiarachchige & Stefan von Cavallar & Timothy Lynar & Roslyn I Hickson & Manoj Gambhir, 2018. "Risk prediction system for dengue transmission based on high resolution weather data," PLOS ONE, Public Library of Science, vol. 13(12), pages 1-17, December.
    14. Logan C Brooks & David C Farrow & Sangwon Hyun & Ryan J Tibshirani & Roni Rosenfeld, 2018. "Nonmechanistic forecasts of seasonal influenza with iterative one-week-ahead distributions," PLOS Computational Biology, Public Library of Science, vol. 14(6), pages 1-29, June.
    15. Gerardo Manzo & Antonio Picca, 2020. "The Impact of Sovereign Shocks," Management Science, INFORMS, vol. 66(7), pages 3113-3132, July.
    16. António Afonso & Yasfir Ibraimo, 2020. "The macroeconomic effects of public debt: an empirical analysis of Mozambique," Applied Economics, Taylor & Francis Journals, vol. 52(2), pages 212-226, January.
    17. Evangelos Salachas & Georgios P. Kouretas & Nikiforos T. Laopodis, 2024. "The term structure of interest rates and economic activity: Evidence from the COVID‐19 pandemic," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(4), pages 1018-1041, July.
    18. Ignacio Lozano Espitia & Karen Rodríguez, 2009. "Assessing the Macroeconomic Effects of Fiscal," Borradores de Economia 5386, Banco de la Republica.
    19. Sunil S. Poshakwale & Pankaj Chandorkar, 2019. "The Impact of Aggregate and Disaggregate Consumption Shocks on the Equity Risk Premium in the United Kingdom," Annals of Economics and Finance, Society for AEF, vol. 20(2), pages 489-524, November.
    20. Wang, Yuanyuan & Chi, Yuanying & Xu, Jin-Hua & Yuan, Yongke, 2022. "Consumers’ attitudes and their effects on electric vehicle sales and charging infrastructure construction: An empirical study in China," Energy Policy, Elsevier, vol. 165(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1007518. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.