IDEAS home Printed from https://ideas.repec.org/a/plo/pgph00/0000495.html
   My bibliography  Save this article

Accuracy comparison of ARIMA and XGBoost forecasting models in predicting the incidence of COVID-19 in Bangladesh

Author

Listed:
  • Md Siddikur Rahman
  • Arman Hossain Chowdhury
  • Miftahuzzannat Amrin

Abstract

Accurate predictive time series modelling is important in public health planning and response during the emergence of a novel pandemic. Therefore, the aims of the study are three-fold: (a) to model the overall trend of COVID-19 confirmed cases and deaths in Bangladesh; (b) to generate a short-term forecast of 8 weeks of COVID-19 cases and deaths; (c) to compare the predictive accuracy of the Autoregressive Integrated Moving Average (ARIMA) and eXtreme Gradient Boosting (XGBoost) for precise modelling of non-linear features and seasonal trends of the time series. The data were collected from the onset of the epidemic in Bangladesh from the Directorate General of Health Service (DGHS) and Institute of Epidemiology, Disease Control and Research (IEDCR). The daily confirmed cases and deaths of COVID-19 of 633 days in Bangladesh were divided into several training and test sets. The ARIMA and XGBoost models were established using those training data, and the test sets were used to evaluate each model’s ability to forecast and finally averaged all the predictive performances to choose the best model. The predictive accuracy of the models was assessed using the mean absolute error (MAE), mean percentage error (MPE), root mean square error (RMSE) and mean absolute percentage error (MAPE). The findings reveal the existence of a nonlinear trend and weekly seasonality in the dataset. The average error measures of the ARIMA model for both COVID-19 confirmed cases and deaths were lower than XGBoost model. Hence, in our study, the ARIMA model performed better than the XGBoost model in predicting COVID-19 confirmed cases and deaths in Bangladesh. The suggested prediction model might play a critical role in estimating the spread of a novel pandemic in Bangladesh and similar countries.

Suggested Citation

  • Md Siddikur Rahman & Arman Hossain Chowdhury & Miftahuzzannat Amrin, 2022. "Accuracy comparison of ARIMA and XGBoost forecasting models in predicting the incidence of COVID-19 in Bangladesh," PLOS Global Public Health, Public Library of Science, vol. 2(5), pages 1-13, May.
  • Handle: RePEc:plo:pgph00:0000495
    DOI: 10.1371/journal.pgph.0000495
    as

    Download full text from publisher

    File URL: https://journals.plos.org/globalpublichealth/article?id=10.1371/journal.pgph.0000495
    Download Restriction: no

    File URL: https://journals.plos.org/globalpublichealth/article/file?id=10.1371/journal.pgph.0000495&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgph.0000495?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Md Siddikur Rahman & Arman Hossain Chowdhury, 2022. "A data-driven eXtreme gradient boosting machine learning model to predict COVID-19 transmission with meteorological drivers," PLOS ONE, Public Library of Science, vol. 17(9), pages 1-14, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgph00:0000495. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: globalpubhealth (email available below). General contact details of provider: https://journals.plos.org/globalpublichealth .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.