IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0241332.html
   My bibliography  Save this article

Country-level pandemic risk and preparedness classification based on COVID-19 data: A machine learning approach

Author

Listed:
  • Jordan J Bird
  • Chloe M Barnes
  • Cristiano Premebida
  • Anikó Ekárt
  • Diego R Faria

Abstract

In this work we present a three-stage Machine Learning strategy to country-level risk classification based on countries that are reporting COVID-19 information. A K% binning discretisation (K = 25) is used to create four risk groups of countries based on the risk of transmission (coronavirus cases per million population), risk of mortality (coronavirus deaths per million population), and risk of inability to test (coronavirus tests per million population). The four risk groups produced by K% binning are labelled as ‘low’, ‘medium-low’, ‘medium-high’, and ‘high’. Coronavirus-related data are then removed and the attributes for prediction of the three types of risk are given as the geopolitical and demographic data describing each country. Thus, the calculation of class label is based on coronavirus data but the input attributes are country-level information regardless of coronavirus data. The three four-class classification problems are then explored and benchmarked through leave-one-country-out cross validation to find the strongest model, producing a Stack of Gradient Boosting and Decision Tree algorithms for risk of transmission, a Stack of Support Vector Machine and Extra Trees for risk of mortality, and a Gradient Boosting algorithm for the risk of inability to test. It is noted that high risk for inability to test is often coupled with low risks for transmission and mortality, therefore the risk of inability to test should be interpreted first, before consideration is given to the predicted transmission and mortality risks. Finally, the approach is applied to more recent risk levels to data from September 2020 and weaker results are noted due to the growth of international collaboration detracting useful knowledge from country-level attributes which suggests that similar machine learning approaches are more useful prior to situations later unfolding.

Suggested Citation

  • Jordan J Bird & Chloe M Barnes & Cristiano Premebida & Anikó Ekárt & Diego R Faria, 2020. "Country-level pandemic risk and preparedness classification based on COVID-19 data: A machine learning approach," PLOS ONE, Public Library of Science, vol. 15(10), pages 1-20, October.
  • Handle: RePEc:plo:pone00:0241332
    DOI: 10.1371/journal.pone.0241332
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0241332
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0241332&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0241332?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Cleo Anastassopoulou & Lucia Russo & Athanasios Tsakris & Constantinos Siettos, 2020. "Data-based analysis, modelling and forecasting of the COVID-19 outbreak," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-21, March.
    2. Hortaçsu, Ali & Liu, Jiarui & Schwieg, Timothy, 2021. "Estimating the fraction of unreported infections in epidemics with a known epicenter: An application to COVID-19," Journal of Econometrics, Elsevier, vol. 220(1), pages 106-129.
    3. Fotios Petropoulos & Spyros Makridakis, 2020. "Forecasting the novel coronavirus COVID-19," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-8, March.
    4. Zhiyuan Ma & Ping Wang & Zehui Gao & Ruobing Wang & Koroush Khalighi, 2018. "Ensemble of machine learning algorithms using the stacked generalization approach to estimate the warfarin dose," PLOS ONE, Public Library of Science, vol. 13(10), pages 1-12, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Amalraj Irudayasamy & D. Ganesh & M. Natesh & N. Rajesh & Umi Salma, 2024. "Big data analytics on the impact of OMICRON and its influence on unvaccinated community through advanced machine learning concepts," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 15(1), pages 346-355, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Noureddine Ouerfelli & Narcisa Vrinceanu & Diana Coman & Adriana Lavinia Cioca, 2022. "Empirical Modeling of COVID-19 Evolution with High/Direct Impact on Public Health and Risk Assessment," IJERPH, MDPI, vol. 19(6), pages 1-13, March.
    2. Konstantinos Demertzis & Dimitrios Tsiotas & Lykourgos Magafas, 2020. "Modeling and Forecasting the COVID-19 Temporal Spread in Greece: An Exploratory Approach Based on Complex Network Defined Splines," IJERPH, MDPI, vol. 17(13), pages 1-17, June.
    3. Cem Cakmakli & Yasin Simsek, 2020. "Bridging the COVID-19 Data and the Epidemiological Model using Time Varying Parameter SIRD Model," Papers 2007.02726, arXiv.org, revised Feb 2021.
    4. Nathan H. Schumaker & Sydney M. Watkins, 2021. "Adding Space to Disease Models: A Case Study with COVID-19 in Oregon, USA," Land, MDPI, vol. 10(4), pages 1-13, April.
    5. Gregory L Watson & Di Xiong & Lu Zhang & Joseph A Zoller & John Shamshoian & Phillip Sundin & Teresa Bufford & Anne W Rimoin & Marc A Suchard & Christina M Ramirez, 2021. "Pandemic velocity: Forecasting COVID-19 in the US with a machine learning & Bayesian time series compartmental model," PLOS Computational Biology, Public Library of Science, vol. 17(3), pages 1-20, March.
    6. Giacomo De Nicola & Marc Schneble & Göran Kauermann & Ursula Berger, 2022. "Regional now- and forecasting for data reported with delay: toward surveillance of COVID-19 infections," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 106(3), pages 407-426, September.
    7. Cem Cakmakli & Yasin Simsek, 2023. "Bridging the Covid-19 Data and the Epidemiological Model using Time-Varying Parameter SIRD Model," Papers 2301.13692, arXiv.org.
    8. Vaishnav, Vaibhav & Vajpai, Jayashri, 2020. "Assessment of impact of relaxation in lockdown and forecast of preparation for combating COVID-19 pandemic in India using Group Method of Data Handling," Chaos, Solitons & Fractals, Elsevier, vol. 140(C).
    9. Waychal, Nachiketas & Laha, Arnab Kumar & Sinha, Ankur, 2022. "Customized forecasting with Adaptive Ensemble Generator," IIMA Working Papers WP 2022-06-04, Indian Institute of Management Ahmedabad, Research and Publication Department.
    10. Dante Miller & Jong-Min Kim, 2021. "Univariate and Multivariate Machine Learning Forecasting Models on the Price Returns of Cryptocurrencies," JRFM, MDPI, vol. 14(10), pages 1-10, October.
    11. Masum, Mohammad & Masud, M.A. & Adnan, Muhaiminul Islam & Shahriar, Hossain & Kim, Sangil, 2022. "Comparative study of a mathematical epidemic model, statistical modeling, and deep learning for COVID-19 forecasting and management," Socio-Economic Planning Sciences, Elsevier, vol. 80(C).
    12. Das, Saikat & Bose, Indranil & Sarkar, Uttam Kumar, 2023. "Predicting the outbreak of epidemics using a network-based approach," European Journal of Operational Research, Elsevier, vol. 309(2), pages 819-831.
    13. František Božek & Irena Tušer, 2021. "Measures for Ensuring Sustainability during the Current Spreading of Coronaviruses in the Czech Republic," Sustainability, MDPI, vol. 13(12), pages 1-22, June.
    14. Khan, Waqas & Walker, Shalika & Zeiler, Wim, 2022. "Improved solar photovoltaic energy generation forecast using deep learning-based ensemble stacking approach," Energy, Elsevier, vol. 240(C).
    15. Chen, Xiaowei & Chong, Wing Fung & Feng, Runhuan & Zhang, Linfeng, 2021. "Pandemic risk management: Resources contingency planning and allocation," Insurance: Mathematics and Economics, Elsevier, vol. 101(PB), pages 359-383.
    16. Centorrino, Samuele & Parmeter, Christopher F., 2024. "Nonparametric estimation of stochastic frontier models with weak separability," Journal of Econometrics, Elsevier, vol. 238(2).
    17. Robert S. Pindyck, 2020. "COVID-19 and the Welfare Effects of Reducing Contagion," NBER Working Papers 27121, National Bureau of Economic Research, Inc.
    18. David Berger & Kyle Herkenhoff & Chengdai Huang & Simon Mongey, 2022. "Testing and Reopening in an SEIR Model," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 43, pages 1-21, January.
    19. Cooper, Ian & Mondal, Argha & Antonopoulos, Chris G., 2020. "Dynamic tracking with model-based forecasting for the spread of the COVID-19 pandemic," Chaos, Solitons & Fractals, Elsevier, vol. 139(C).
    20. Cooper, Ian & Mondal, Argha & Antonopoulos, Chris G., 2020. "A SIR model assumption for the spread of COVID-19 in different communities," Chaos, Solitons & Fractals, Elsevier, vol. 139(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0241332. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.