IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1008837.html
   My bibliography  Save this article

Pandemic velocity: Forecasting COVID-19 in the US with a machine learning & Bayesian time series compartmental model

Author

Listed:
  • Gregory L Watson
  • Di Xiong
  • Lu Zhang
  • Joseph A Zoller
  • John Shamshoian
  • Phillip Sundin
  • Teresa Bufford
  • Anne W Rimoin
  • Marc A Suchard
  • Christina M Ramirez

Abstract

Predictions of COVID-19 case growth and mortality are critical to the decisions of political leaders, businesses, and individuals grappling with the pandemic. This predictive task is challenging due to the novelty of the virus, limited data, and dynamic political and societal responses. We embed a Bayesian time series model and a random forest algorithm within an epidemiological compartmental model for empirically grounded COVID-19 predictions. The Bayesian case model fits a location-specific curve to the velocity (first derivative) of the log transformed cumulative case count, borrowing strength across geographic locations and incorporating prior information to obtain a posterior distribution for case trajectories. The compartmental model uses this distribution and predicts deaths using a random forest algorithm trained on COVID-19 data and population-level characteristics, yielding daily projections and interval estimates for cases and deaths in U.S. states. We evaluated the model by training it on progressively longer periods of the pandemic and computing its predictive accuracy over 21-day forecasts. The substantial variation in predicted trajectories and associated uncertainty between states is illustrated by comparing three unique locations: New York, Colorado, and West Virginia. The sophistication and accuracy of this COVID-19 model offer reliable predictions and uncertainty estimates for the current trajectory of the pandemic in the U.S. and provide a platform for future predictions as shifting political and societal responses alter its course.Author summary: COVID-19 models can be roughly classified as mathematical models that simulate disease within a population, including epidemiological compartmental models, or statistical curve-fitting models that fit a function to observed data and extrapolate forward into the future. Bridging this divide, we combine the strengths of curve-fitting statistical models and the structure of epidemiological models, by embedding a Bayesian velocity model and a machine learning algorithm (random forest) into the framework of a compartmental model. Fusing these models together exploits the particular strengths of each to glean as much information as possible from the currently available data. We identify the velocity of log cumulative cases as an excellent target for modeling and extrapolating COVID-19 case trajectories. We empirically evaluate the predictive performance of the model and provide predicted trajectories with credible intervals for cumulative confirmed case count, active confirmed infections and COVID-19 deaths for each of the 50 U.S. states. Combining sophisticated data analytic methods with proven epidemiological models offers an empirically grounded strategy for making realistic predictions and quantifying their uncertainty. These predictions indicate substantial variation in the COVID-19 trajectories of U.S. states.

Suggested Citation

  • Gregory L Watson & Di Xiong & Lu Zhang & Joseph A Zoller & John Shamshoian & Phillip Sundin & Teresa Bufford & Anne W Rimoin & Marc A Suchard & Christina M Ramirez, 2021. "Pandemic velocity: Forecasting COVID-19 in the US with a machine learning & Bayesian time series compartmental model," PLOS Computational Biology, Public Library of Science, vol. 17(3), pages 1-20, March.
  • Handle: RePEc:plo:pcbi00:1008837
    DOI: 10.1371/journal.pcbi.1008837
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1008837
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1008837&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1008837?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. David Berger & Kyle Herkenhoff & Chengdai Huang & Simon Mongey, 2022. "Testing and Reopening in an SEIR Model," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 43, pages 1-21, January.
    2. Zhang, Xiaolei & Ma, Renjun & Wang, Lin, 2020. "Predicting turning point, duration and attack rate of COVID-19 outbreaks in major Western countries," Chaos, Solitons & Fractals, Elsevier, vol. 135(C).
    3. Roman Wölfel & Victor M. Corman & Wolfgang Guggemos & Michael Seilmaier & Sabine Zange & Marcel A. Müller & Daniela Niemeyer & Terry C. Jones & Patrick Vollmar & Camilla Rothe & Michael Hoelscher & To, 2020. "Virological assessment of hospitalized patients with COVID-2019," Nature, Nature, vol. 581(7809), pages 465-469, May.
    4. Ndaïrou, Faïçal & Area, Iván & Nieto, Juan J. & Torres, Delfim F.M., 2020. "Mathematical modeling of COVID-19 transmission dynamics with a case study of Wuhan," Chaos, Solitons & Fractals, Elsevier, vol. 135(C).
    5. Soetaert, Karline & Petzoldt, Thomas & Setzer, R. Woodrow, 2010. "Solving Differential Equations in R: Package deSolve," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i09).
    6. Cleo Anastassopoulou & Lucia Russo & Athanasios Tsakris & Constantinos Siettos, 2020. "Data-based analysis, modelling and forecasting of the COVID-19 outbreak," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-21, March.
    7. Fotios Petropoulos & Spyros Makridakis, 2020. "Forecasting the novel coronavirus COVID-19," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-8, March.
    8. Sheryl L. Chang & Nathan Harding & Cameron Zachreson & Oliver M. Cliff & Mikhail Prokopenko, 2020. "Modelling transmission and control of the COVID-19 pandemic in Australia," Nature Communications, Nature, vol. 11(1), pages 1-13, December.
    9. Hyndman, Rob J. & Koehler, Anne B., 2006. "Another look at measures of forecast accuracy," International Journal of Forecasting, Elsevier, vol. 22(4), pages 679-688.
    10. Sepehr Rafieenasab & Amir-Pouyan Zahiri & Ehsan Roohi, 2020. "Prediction of peak and termination of novel coronavirus COVID-19 epidemic in Iran," International Journal of Modern Physics C (IJMPC), World Scientific Publishing Co. Pte. Ltd., vol. 31(11), pages 1-17, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yang Ye & Abhishek Pandey & Carolyn Bawden & Dewan Md. Sumsuzzman & Rimpi Rajput & Affan Shoukat & Burton H. Singer & Seyed M. Moghadas & Alison P. Galvani, 2025. "Integrating artificial intelligence with mechanistic epidemiological modeling: a scoping review of opportunities and challenges," Nature Communications, Nature, vol. 16(1), pages 1-18, December.
    2. Gerardo Chowell & Sushma Dahal & Amna Tariq & Kimberlyn Roosa & James M Hyman & Ruiyan Luo, 2022. "An ensemble n-sub-epidemic modeling framework for short-term forecasting epidemic trajectories: Application to the COVID-19 pandemic in the USA," PLOS Computational Biology, Public Library of Science, vol. 18(10), pages 1-20, October.
    3. Roberto Vega & Leonardo Flores & Russell Greiner, 2022. "SIMLR: Machine Learning inside the SIR Model for COVID-19 Forecasting," Forecasting, MDPI, vol. 4(1), pages 1-23, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nathan H. Schumaker & Sydney M. Watkins, 2021. "Adding Space to Disease Models: A Case Study with COVID-19 in Oregon, USA," Land, MDPI, vol. 10(4), pages 1-13, April.
    2. Waychal, Nachiketas & Laha, Arnab Kumar & Sinha, Ankur, 2022. "Customized forecasting with Adaptive Ensemble Generator," IIMA Working Papers WP 2022-06-04, Indian Institute of Management Ahmedabad, Research and Publication Department.
    3. Masum, Mohammad & Masud, M.A. & Adnan, Muhaiminul Islam & Shahriar, Hossain & Kim, Sangil, 2022. "Comparative study of a mathematical epidemic model, statistical modeling, and deep learning for COVID-19 forecasting and management," Socio-Economic Planning Sciences, Elsevier, vol. 80(C).
    4. Cooper, Ian & Mondal, Argha & Antonopoulos, Chris G., 2020. "Dynamic tracking with model-based forecasting for the spread of the COVID-19 pandemic," Chaos, Solitons & Fractals, Elsevier, vol. 139(C).
    5. Rabih Ghostine & Mohamad Gharamti & Sally Hassrouny & Ibrahim Hoteit, 2021. "Mathematical Modeling of Immune Responses against SARS-CoV-2 Using an Ensemble Kalman Filter," Mathematics, MDPI, vol. 9(19), pages 1-13, September.
    6. Semenoglou, Artemios-Anargyros & Spiliotis, Evangelos & Makridakis, Spyros & Assimakopoulos, Vassilios, 2021. "Investigating the accuracy of cross-learning time series forecasting methods," International Journal of Forecasting, Elsevier, vol. 37(3), pages 1072-1084.
    7. Dalton Garcia Borges de Souza & Erivelton Antonio dos Santos & Francisco Tarcísio Alves Júnior & Mariá Cristina Vasconcelos Nascimento, 2021. "On Comparing Cross-Validated Forecasting Models with a Novel Fuzzy-TOPSIS Metric: A COVID-19 Case Study," Sustainability, MDPI, vol. 13(24), pages 1-25, December.
    8. Konstantinos Demertzis & Dimitrios Tsiotas & Lykourgos Magafas, 2020. "Modeling and Forecasting the COVID-19 Temporal Spread in Greece: An Exploratory Approach Based on Complex Network Defined Splines," IJERPH, MDPI, vol. 17(13), pages 1-17, June.
    9. Sinitsyn, E. V. & Tolmachev, A. V. & Ovchinnikov, A. S., 2020. "Socio-economic factors in the spread of SARS-COV-2 across Russian regions," R-Economy, Ural Federal University, Graduate School of Economics and Management, vol. 6(3), pages 129-145.
    10. Pelinovsky, Efim & Kurkin, Andrey & Kurkina, Oxana & Kokoulina, Maria & Epifanova, Anastasia, 2020. "Logistic equation and COVID-19," Chaos, Solitons & Fractals, Elsevier, vol. 140(C).
    11. Michał Wieczorek & Jakub Siłka & Dawid Połap & Marcin Woźniak & Robertas Damaševičius, 2020. "Real-time neural network based predictor for cov19 virus spread," PLOS ONE, Public Library of Science, vol. 15(12), pages 1-18, December.
    12. Jordan J Bird & Chloe M Barnes & Cristiano Premebida & Anikó Ekárt & Diego R Faria, 2020. "Country-level pandemic risk and preparedness classification based on COVID-19 data: A machine learning approach," PLOS ONE, Public Library of Science, vol. 15(10), pages 1-20, October.
    13. da Silva, Ramon Gomes & Ribeiro, Matheus Henrique Dal Molin & Mariani, Viviana Cocco & Coelho, Leandro dos Santos, 2020. "Forecasting Brazilian and American COVID-19 cases based on artificial intelligence coupled with climatic exogenous variables," Chaos, Solitons & Fractals, Elsevier, vol. 139(C).
    14. Tsiligianni, Christiana & Tsiligiannis, Aristeides & Tsiliyannis, Christos, 2023. "A stochastic inventory model of COVID-19 and robust, real-time identification of carriers at large and infection rate via asymptotic laws," European Journal of Operational Research, Elsevier, vol. 304(1), pages 42-56.
    15. Roland Pongou & Guy Tchuente & Jean-Baptiste Tondji, 2021. "Optimally Targeting Interventions in Networks during a Pandemic: Theory and Evidence from the Networks of Nursing Homes in the United States," Papers 2110.10230, arXiv.org.
    16. Jiří Mazurek, 2021. "The evaluation of COVID-19 prediction precision with a Lyapunov-like exponent," PLOS ONE, Public Library of Science, vol. 16(5), pages 1-9, May.
    17. Giacomo De Nicola & Marc Schneble & Göran Kauermann & Ursula Berger, 2022. "Regional now- and forecasting for data reported with delay: toward surveillance of COVID-19 infections," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 106(3), pages 407-426, September.
    18. Aman Khakharia & Vruddhi Shah & Sankalp Jain & Jash Shah & Amanshu Tiwari & Prathamesh Daphal & Mahesh Warang & Ninad Mehendale, 2021. "Outbreak Prediction of COVID-19 for Dense and Populated Countries Using Machine Learning," Annals of Data Science, Springer, vol. 8(1), pages 1-19, March.
    19. Cooper, Ian & Mondal, Argha & Antonopoulos, Chris G., 2020. "A SIR model assumption for the spread of COVID-19 in different communities," Chaos, Solitons & Fractals, Elsevier, vol. 139(C).
    20. Memon, Zaibunnisa & Qureshi, Sania & Memon, Bisharat Rasool, 2021. "Assessing the role of quarantine and isolation as control strategies for COVID-19 outbreak: A case study," Chaos, Solitons & Fractals, Elsevier, vol. 144(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1008837. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.