IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1008837.html
   My bibliography  Save this article

Pandemic velocity: Forecasting COVID-19 in the US with a machine learning & Bayesian time series compartmental model

Author

Listed:
  • Gregory L Watson
  • Di Xiong
  • Lu Zhang
  • Joseph A Zoller
  • John Shamshoian
  • Phillip Sundin
  • Teresa Bufford
  • Anne W Rimoin
  • Marc A Suchard
  • Christina M Ramirez

Abstract

Predictions of COVID-19 case growth and mortality are critical to the decisions of political leaders, businesses, and individuals grappling with the pandemic. This predictive task is challenging due to the novelty of the virus, limited data, and dynamic political and societal responses. We embed a Bayesian time series model and a random forest algorithm within an epidemiological compartmental model for empirically grounded COVID-19 predictions. The Bayesian case model fits a location-specific curve to the velocity (first derivative) of the log transformed cumulative case count, borrowing strength across geographic locations and incorporating prior information to obtain a posterior distribution for case trajectories. The compartmental model uses this distribution and predicts deaths using a random forest algorithm trained on COVID-19 data and population-level characteristics, yielding daily projections and interval estimates for cases and deaths in U.S. states. We evaluated the model by training it on progressively longer periods of the pandemic and computing its predictive accuracy over 21-day forecasts. The substantial variation in predicted trajectories and associated uncertainty between states is illustrated by comparing three unique locations: New York, Colorado, and West Virginia. The sophistication and accuracy of this COVID-19 model offer reliable predictions and uncertainty estimates for the current trajectory of the pandemic in the U.S. and provide a platform for future predictions as shifting political and societal responses alter its course.Author summary: COVID-19 models can be roughly classified as mathematical models that simulate disease within a population, including epidemiological compartmental models, or statistical curve-fitting models that fit a function to observed data and extrapolate forward into the future. Bridging this divide, we combine the strengths of curve-fitting statistical models and the structure of epidemiological models, by embedding a Bayesian velocity model and a machine learning algorithm (random forest) into the framework of a compartmental model. Fusing these models together exploits the particular strengths of each to glean as much information as possible from the currently available data. We identify the velocity of log cumulative cases as an excellent target for modeling and extrapolating COVID-19 case trajectories. We empirically evaluate the predictive performance of the model and provide predicted trajectories with credible intervals for cumulative confirmed case count, active confirmed infections and COVID-19 deaths for each of the 50 U.S. states. Combining sophisticated data analytic methods with proven epidemiological models offers an empirically grounded strategy for making realistic predictions and quantifying their uncertainty. These predictions indicate substantial variation in the COVID-19 trajectories of U.S. states.

Suggested Citation

  • Gregory L Watson & Di Xiong & Lu Zhang & Joseph A Zoller & John Shamshoian & Phillip Sundin & Teresa Bufford & Anne W Rimoin & Marc A Suchard & Christina M Ramirez, 2021. "Pandemic velocity: Forecasting COVID-19 in the US with a machine learning & Bayesian time series compartmental model," PLOS Computational Biology, Public Library of Science, vol. 17(3), pages 1-20, March.
  • Handle: RePEc:plo:pcbi00:1008837
    DOI: 10.1371/journal.pcbi.1008837
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1008837
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1008837&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1008837?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. David Berger & Kyle Herkenhoff & Chengdai Huang & Simon Mongey, 2022. "Testing and Reopening in an SEIR Model," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 43, pages 1-21, January.
    2. David Berger & Kyle Herkenhoff & Chengdai Huang & Simon Mongey, 2022. "Testing and Reopening in an SEIR Model," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 43, pages 1-21, January.
    3. Zhang, Xiaolei & Ma, Renjun & Wang, Lin, 2020. "Predicting turning point, duration and attack rate of COVID-19 outbreaks in major Western countries," Chaos, Solitons & Fractals, Elsevier, vol. 135(C).
    4. Roman Wölfel & Victor M. Corman & Wolfgang Guggemos & Michael Seilmaier & Sabine Zange & Marcel A. Müller & Daniela Niemeyer & Terry C. Jones & Patrick Vollmar & Camilla Rothe & Michael Hoelscher & To, 2020. "Virological assessment of hospitalized patients with COVID-2019," Nature, Nature, vol. 581(7809), pages 465-469, May.
    5. Ndaïrou, Faïçal & Area, Iván & Nieto, Juan J. & Torres, Delfim F.M., 2020. "Mathematical modeling of COVID-19 transmission dynamics with a case study of Wuhan," Chaos, Solitons & Fractals, Elsevier, vol. 135(C).
    6. Soetaert, Karline & Petzoldt, Thomas & Setzer, R. Woodrow, 2010. "Solving Differential Equations in R: Package deSolve," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i09).
    7. Cleo Anastassopoulou & Lucia Russo & Athanasios Tsakris & Constantinos Siettos, 2020. "Data-based analysis, modelling and forecasting of the COVID-19 outbreak," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-21, March.
    8. Fotios Petropoulos & Spyros Makridakis, 2020. "Forecasting the novel coronavirus COVID-19," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-8, March.
    9. Sheryl L. Chang & Nathan Harding & Cameron Zachreson & Oliver M. Cliff & Mikhail Prokopenko, 2020. "Modelling transmission and control of the COVID-19 pandemic in Australia," Nature Communications, Nature, vol. 11(1), pages 1-13, December.
    10. Hyndman, Rob J. & Koehler, Anne B., 2006. "Another look at measures of forecast accuracy," International Journal of Forecasting, Elsevier, vol. 22(4), pages 679-688.
    11. Sepehr Rafieenasab & Amir-Pouyan Zahiri & Ehsan Roohi, 2020. "Prediction of peak and termination of novel coronavirus COVID-19 epidemic in Iran," International Journal of Modern Physics C (IJMPC), World Scientific Publishing Co. Pte. Ltd., vol. 31(11), pages 1-17, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Roberto Vega & Leonardo Flores & Russell Greiner, 2022. "SIMLR: Machine Learning inside the SIR Model for COVID-19 Forecasting," Forecasting, MDPI, vol. 4(1), pages 1-23, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pongou, Roland & Tchuente, Guy & Tondji, Jean-Baptiste, 2021. "Optimally Targeting Interventions in Networks during a Pandemic: Theory and Evidence from the Networks of Nursing Homes in the United States," GLO Discussion Paper Series 957, Global Labor Organization (GLO).
    2. Nathan H. Schumaker & Sydney M. Watkins, 2021. "Adding Space to Disease Models: A Case Study with COVID-19 in Oregon, USA," Land, MDPI, vol. 10(4), pages 1-13, April.
    3. Roland Pongou & Guy Tchuente & Jean-Baptiste Tondji, 2021. "Optimally Targeting Interventions in Networks during a Pandemic: Theory and Evidence from the Networks of Nursing Homes in the United States," Papers 2110.10230, arXiv.org.
    4. Waychal, Nachiketas & Laha, Arnab Kumar & Sinha, Ankur, 2022. "Customized forecasting with Adaptive Ensemble Generator," IIMA Working Papers WP 2022-06-04, Indian Institute of Management Ahmedabad, Research and Publication Department.
    5. Ichino, Andrea & Favero, Carlo A. & Rustichini, Aldo, 2020. "Restarting the economy while saving lives under Covid-19," CEPR Discussion Papers 14664, C.E.P.R. Discussion Papers.
    6. Brodeur, Abel & Clark, Andrew E. & Fleche, Sarah & Powdthavee, Nattavudh, 2021. "COVID-19, lockdowns and well-being: Evidence from Google Trends," Journal of Public Economics, Elsevier, vol. 193(C).
    7. Charles A.E. Goodhart & Dimitrios P. Tsomocos & Xuan Wang, 2023. "Support for small businesses amid COVID‐19," Economica, London School of Economics and Political Science, vol. 90(358), pages 612-652, April.
    8. David Baqaee & Emmanuel Farhi, 2020. "Nonlinear Production Networks with an Application to the Covid-19 Crisis," NBER Working Papers 27281, National Bureau of Economic Research, Inc.
    9. Graham, James & Ozbilgin, Murat, 2021. "Age, industry, and unemployment risk during a pandemic lockdown," Journal of Economic Dynamics and Control, Elsevier, vol. 133(C).
    10. Louis-Philippe Beland & Abel Brodeur & Taylor Wright, 2020. "COVID-19, Stay-at-Home Orders and Employment: Evidence from CPS Data," Carleton Economic Papers 20-04, Carleton University, Department of Economics, revised 19 May 2020.
    11. Houštecká, Anna & Koh, Dongya & Santaeulàlia-Llopis, Raül, 2021. "Contagion at work: Occupations, industries and human contact," Journal of Public Economics, Elsevier, vol. 200(C).
    12. Xiao Chen & Hanwei Huang & Jiandong Ju & Ruoyan Sun & Jialiang Zhang, 2022. "Endogenous cross-region human mobility and pandemics," CEP Discussion Papers dp1860, Centre for Economic Performance, LSE.
    13. Shami, Labib & Lazebnik, Teddy, 2022. "Economic aspects of the detection of new strains in a multi-strain epidemiological–mathematical model," Chaos, Solitons & Fractals, Elsevier, vol. 165(P2).
    14. Gopal K. Basak & Chandramauli Chakraborty & Pranab Kumar Das, 2021. "Optimal Lockdown Strategy in a Pandemic: An Exploratory Analysis for Covid-19," Papers 2109.02512, arXiv.org.
    15. Laura Alfaro & Anusha Chari & Andrew N. Greenland & Peter K. Schott, 2020. "Aggregate and Firm-Level Stock Returns During Pandemics, in Real Time," NBER Working Papers 26950, National Bureau of Economic Research, Inc.
    16. Chen, Simiao & Jin, Zhangfeng & Bloom, David E., 2020. "Act Early to Prevent Infections and Save Lives: Causal Impact of Diagnostic Efficiency on the COVID-19 Pandemic," IZA Discussion Papers 13749, Institute of Labor Economics (IZA).
    17. Hortaçsu, Ali & Liu, Jiarui & Schwieg, Timothy, 2021. "Estimating the fraction of unreported infections in epidemics with a known epicenter: An application to COVID-19," Journal of Econometrics, Elsevier, vol. 220(1), pages 106-129.
    18. Joshua Bernstein & Alexander W. Richter & Nathaniel A. Throckmorton, 2020. "COVID-19: A View from the Labor Market," Working Papers 2010, Federal Reserve Bank of Dallas.
    19. Masum, Mohammad & Masud, M.A. & Adnan, Muhaiminul Islam & Shahriar, Hossain & Kim, Sangil, 2022. "Comparative study of a mathematical epidemic model, statistical modeling, and deep learning for COVID-19 forecasting and management," Socio-Economic Planning Sciences, Elsevier, vol. 80(C).
    20. Daron Acemoglu & Victor Chernozhukov & Iván Werning & Michael D. Whinston, 2021. "Optimal Targeted Lockdowns in a Multigroup SIR Model," American Economic Review: Insights, American Economic Association, vol. 3(4), pages 487-502, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1008837. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.