IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1013709.html

Real-time forecasting of data revisions in epidemic surveillance streams

Author

Listed:
  • Jingjing Tang
  • Aaron Rumack
  • Bryan Wilder
  • Roni Rosenfeld

Abstract

Epidemic data streams undergo frequent revisions due to reporting delays (“backfill”) and other factors. Relying on tentative surveillance values can seriously degrade the quality of situational awareness, forecasting accuracy and decision-making. We introduce Delphi Revision Forecast (Delphi-RF), a real-time data revision forecasting framework using nonparametric quantile regression, applicable to both counts and proportions (fractions) in public health reporting. By incorporating all available revisions up to a given estimation date, Delphi-RF models revision dynamics and generates distributional forecasts of finalized surveillance values. Applied to daily COVID-19 data (insurance claims, antigen tests, confirmed cases) and weekly dengue and influenza-like illness (ILI) case counts, Delphi-RF delivers accurate revision forecasts, particularly in early reporting stages. In addition, it improves computational efficiency by more than 10-100x compared to existing methods, making it a scalable solution for real-time public health surveillance.Author summary: Accurate and reliable forecasts of infectious disease epidemics, such as COVID-19, are essential but challenging. The presence of data revisions in public health data streams can introduce significant biases in both predictors and responses, leading to suboptimal situational awareness, preparedness, and downstream countermeasure design. To address this issue, we propose a modeling framework that leverages historical revision patterns to generate distributional forecasts of finalized surveillance values. Applicable to both count-type and fraction-type data across various temporal resolutions and epidemic surveillance data streams, our approach ensures real-time accuracy, even with only early revisions available. Moreover, our method achieves competitive or superior forecast accuracy compared to existing methods, while also demonstrating a more than 10-100x improvement in computational efficiency.

Suggested Citation

  • Jingjing Tang & Aaron Rumack & Bryan Wilder & Roni Rosenfeld, 2025. "Real-time forecasting of data revisions in epidemic surveillance streams," PLOS Computational Biology, Public Library of Science, vol. 21(11), pages 1-24, November.
  • Handle: RePEc:plo:pcbi00:1013709
    DOI: 10.1371/journal.pcbi.1013709
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1013709
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1013709&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1013709?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Croushore, Dean, 2006. "Forecasting with Real-Time Macroeconomic Data," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 1, chapter 17, pages 961-982, Elsevier.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Clements Michael P., 2012. "Forecasting U.S. Output Growth with Non-Linear Models in the Presence of Data Uncertainty," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 16(1), pages 1-27, January.
    2. Andrea Carriero & Todd E. Clark & Massimiliano Marcellino, 2016. "Common Drifting Volatility in Large Bayesian VARs," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(3), pages 375-390, July.
    3. Nikoleta Anesti & Ana Beatriz Galvao & Silvia Miranda-Agrippino, 2018. "Uncertain Kingdom: Nowcasting GDP and its Revisions," Discussion Papers 1824, Centre for Macroeconomics (CFM).
    4. Davide Delle Monache & Andrea De Polis & Ivan Petrella, 2024. "Modeling and Forecasting Macroeconomic Downside Risk," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 42(3), pages 1010-1025, July.
    5. Michael Pfarrhofer, 2024. "Forecasts with Bayesian vector autoregressions under real time conditions," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(3), pages 771-801, April.
    6. Massimiliano Marcellino, 2008. "A linear benchmark for forecasting GDP growth and inflation?," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 27(4), pages 305-340.
    7. Kohns, David & Bhattacharjee, Arnab, 2023. "Nowcasting growth using Google Trends data: A Bayesian Structural Time Series model," International Journal of Forecasting, Elsevier, vol. 39(3), pages 1384-1412.
    8. Andrew Patton & Allan Timmermann, 2012. "Forecast Rationality Tests Based on Multi-Horizon Bounds," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 30(1), pages 1-17.
    9. Clark, Todd & McCracken, Michael, 2013. "Advances in Forecast Evaluation," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1107-1201, Elsevier.
    10. Michael Funke & Aaron Mehrotra & Hao Yu, 2015. "Tracking Chinese CPI inflation in real time," Empirical Economics, Springer, vol. 48(4), pages 1619-1641, June.
    11. Feng Zhao & Guofu Zhou & Xiaoneng Zhu, 2021. "Unspanned Global Macro Risks in Bond Returns," Management Science, INFORMS, vol. 67(12), pages 7825-7843, December.
    12. Krüger, Fabian & Nolte, Ingmar, 2016. "Disagreement versus uncertainty: Evidence from distribution forecasts," Journal of Banking & Finance, Elsevier, vol. 72(S), pages 172-186.
    13. Inske Pirschel & Maik H. Wolters, 2018. "Forecasting with large datasets: compressing information before, during or after the estimation?," Empirical Economics, Springer, vol. 55(2), pages 573-596, September.
    14. Tuhkuri, Joonas, 2016. "Forecasting Unemployment with Google Searches," ETLA Working Papers 35, The Research Institute of the Finnish Economy.
    15. van den Hauwe, Sjoerd & Paap, Richard & van Dijk, Dick, 2013. "Bayesian forecasting of federal funds target rate decisions," Journal of Macroeconomics, Elsevier, vol. 37(C), pages 19-40.
    16. Emilia Tomczyk, 2013. "End of sample vs. real time data: perspectives for analysis of expectations," Working Papers 68, Department of Applied Econometrics, Warsaw School of Economics.
    17. Todd E. Clark & Michael W. McCracken, 2010. "Averaging forecasts from VARs with uncertain instabilities," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 25(1), pages 5-29, January.
    18. S. Boragan Aruoba & Francis X. Diebold, 2010. "Real-Time Macroeconomic Monitoring: Real Activity, Inflation, and Interactions," American Economic Review, American Economic Association, vol. 100(2), pages 20-24, May.
    19. Andrea Carriero & Todd E. Clark & Marcellino Massimiliano, 2020. "Nowcasting Tail Risks to Economic Activity with Many Indicators," Working Papers 20-13R2, Federal Reserve Bank of Cleveland, revised 22 Sep 2020.
    20. Keijsers, Bart & van Dijk, Dick, 2025. "Does economic uncertainty predict real activity in real time?," International Journal of Forecasting, Elsevier, vol. 41(2), pages 748-762.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1013709. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.