IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v328y2023i1d10.1007_s10479-022-04984-x.html
   My bibliography  Save this article

Using machine learning in prediction of ICU admission, mortality, and length of stay in the early stage of admission of COVID-19 patients

Author

Listed:
  • Sara Saadatmand

    (Persian Gulf University)

  • Khodakaram Salimifard

    (Persian Gulf University)

  • Reza Mohammadi

    (University of Amsterdam)

  • Alex Kuiper

    (University of Amsterdam)

  • Maryam Marzban

    (Bushehr University of Medical Science)

  • Akram Farhadi

    (Bushehr University of Medical Science)

Abstract

The recent COVID-19 pandemic has affected health systems across the world. Especially, Intensive Care Units (ICUs) have played a pivotal role in the treatment of critically-ill patients. At the same time however, the increasing number of admissions due to the vast prevalence of the virus have caused several problems for ICU wards such as overburdening of staff and shortages of medical resources. These issues might have affected the quality of healthcare services provided directly impacting a patient’s survival. The objective of this research is to leverage Machine Learning (ML) on hospital data in order to support hospital managers and practitioners with the treatment of COVID-19 patients. This is accomplished by providing more detailed inference about a patient’s likelihood of ICU admission, mortality and in case of hospitalization the length of stay (LOS). In this pursuit, the outcome variables are in three separate models predicted by five different ML algorithms: eXtreme Gradient Boosting (XGB), K-Nearest Neighbor (KNN), Random Forest (RF), bagged-CART (b-CART), and LogitBoost (LB). With the exception of KNN, the studied models show good predictive capabilities when evaluating relevant accuracy scores, such as area under the curve. By implementing an ensemble stacking approach (either a Neural Net or a General Linear Model) on top of the aforementioned ML algorithms the performance is further boosted. Ultimately, for the prediction of admission to the ICU, the ensemble stacking via a Neural Net achieved the best result with an accuracy of over 95%. For mortality at the ICU, the vanilla XGB performed slightly better (1% difference with the meta-model). To predict large length of stays both ensemble stacking approaches yield comparable results. Besides it direct implications for managing COVID-19 patients, the approach presented serves as an example how data can be employed in future pandemics or crises.

Suggested Citation

  • Sara Saadatmand & Khodakaram Salimifard & Reza Mohammadi & Alex Kuiper & Maryam Marzban & Akram Farhadi, 2023. "Using machine learning in prediction of ICU admission, mortality, and length of stay in the early stage of admission of COVID-19 patients," Annals of Operations Research, Springer, vol. 328(1), pages 1043-1071, September.
  • Handle: RePEc:spr:annopr:v:328:y:2023:i:1:d:10.1007_s10479-022-04984-x
    DOI: 10.1007/s10479-022-04984-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-022-04984-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-022-04984-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kursa, Miron B. & Rudnicki, Witold R., 2010. "Feature Selection with the Boruta Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 36(i11).
    2. van Buuren, Stef & Groothuis-Oudshoorn, Karin, 2011. "mice: Multivariate Imputation by Chained Equations in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 45(i03).
    3. Cohen, Jennifer & Rodgers, Yana, 2020. "Contributing Factors to Personal Protective Equipment Shortages during the COVID-19 Pandemic," MPRA Paper 103496, University Library of Munich, Germany.
    4. Patrick Royston, 2004. "Multiple imputation of missing values," Stata Journal, StataCorp LP, vol. 4(3), pages 227-241, September.
    5. Rocío Aznar-Gimeno & Luis M. Esteban & Gorka Labata-Lezaun & Rafael del-Hoyo-Alonso & David Abadia-Gallego & J. Ramón Paño-Pardo & M. José Esquillor-Rodrigo & Ángel Lanas & M. Trinidad Serrano, 2021. "A Clinical Decision Web to Predict ICU Admission or Death for Patients Hospitalised with COVID-19 Using Machine Learning Algorithms," IJERPH, MDPI, vol. 18(16), pages 1-20, August.
    6. Mohammed Akour & Izzat Alsmadi & Iyad Alazzam, 2017. "Software fault proneness prediction: a comparative study between bagging, boosting, and stacking ensemble and base learner methods," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 9(1), pages 1-16.
    7. Kuhn, Max, 2008. "Building Predictive Models in R Using the caret Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 28(i05).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Arjan S. Gosal & Janine A. McMahon & Katharine M. Bowgen & Catherine H. Hoppe & Guy Ziv, 2021. "Identifying and Mapping Groups of Protected Area Visitors by Environmental Awareness," Land, MDPI, vol. 10(6), pages 1-14, May.
    2. Manuel S. González Canché, 2017. "Financial Benefits of Rapid Student Loan Repayment: An Analytic Framework Employing Two Decades of Data," The ANNALS of the American Academy of Political and Social Science, , vol. 671(1), pages 154-182, May.
    3. Francesco Sartor & Jonathan P. Moore & Hans-Peter Kubis, 2021. "Plasma Interleukin-10 and Cholesterol Levels May Inform about Interdependences between Fitness and Fatness in Healthy Individuals," IJERPH, MDPI, vol. 18(4), pages 1-19, February.
    4. Faisal Alsayegh & Moh A Alkhamis & Fatima Ali & Sreeja Attur & Nicholas M Fountain-Jones & Mohammad Zubaid, 2022. "Anemia or other comorbidities? using machine learning to reveal deeper insights into the drivers of acute coronary syndromes in hospital admitted patients," PLOS ONE, Public Library of Science, vol. 17(1), pages 1-15, January.
    5. Burns, Christopher & Prager, Daniel & Ghosh, Sujit & Goodwin, Barry, 2015. "Imputing for Missing Data in the ARMS Household Section: A Multivariate Imputation Approach," 2015 AAEA & WAEA Joint Annual Meeting, July 26-28, San Francisco, California 205291, Agricultural and Applied Economics Association.
    6. Franck M. Ramaharo & Michael Fitiavana Randriamifidy, 2023. "Determinants of renewable energy consumption in Madagascar: Evidence from feature selection algorithms," Working Papers hal-04262240, HAL.
    7. Nengsih Titin Agustin & Bertrand Frédéric & Maumy-Bertrand Myriam & Meyer Nicolas, 2019. "Determining the number of components in PLS regression on incomplete data set," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 18(6), pages 1-28, December.
    8. Carolina Alves Costa Silva & Gianmarco Piccinno & Déborah Suissa & Mélanie Bourgin & Gerty Schreibelt & Sylvère Durand & Roxanne Birebent & Marine Fidelle & Cissé Sow & Fanny Aprahamian & Paolo Manghi, 2024. "Influence of microbiota-associated metabolic reprogramming on clinical outcome in patients with melanoma from the randomized adjuvant dendritic cell-based MIND-DC trial," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    9. Svetlana Kresova & Sebastian Hess, 2022. "Identifying the Determinants of Regional Raw Milk Prices in Russia Using Machine Learning," Agriculture, MDPI, vol. 12(7), pages 1-18, July.
    10. Zazueta, Jorge & Zazueta-Hernández, Jorge & Heredia, Andrea Chavez, 2023. "Support Vector Machines and Bankruptcy Prediction," SocArXiv 7z24k, Center for Open Science.
    11. Reza Arabi Belaghi & Joseph Beyene & Sarah D McDonald, 2021. "Prediction of preterm birth in nulliparous women using logistic regression and machine learning," PLOS ONE, Public Library of Science, vol. 16(6), pages 1-22, June.
    12. Kresova, Svetlana & Hess, Sebastian, 2021. "Determinants of Regional Raw Milk Prices in Russia," 2021 Conference, August 17-31, 2021, Virtual 315064, International Association of Agricultural Economists.
    13. Nanna Munck & Patrick Murigu Kamau Njage & Pimlapas Leekitcharoenphon & Eva Litrup & Tine Hald, 2020. "Application of Whole‐Genome Sequences and Machine Learning in Source Attribution of Salmonella Typhimurium," Risk Analysis, John Wiley & Sons, vol. 40(9), pages 1693-1705, September.
    14. Tanzeela Khalid & Raphael Aggio & Paul White & Ben De Lacy Costello & Raj Persad & Huda Al-Kateb & Peter Jones & Chris S Probert & Norman Ratcliffe, 2015. "Urinary Volatile Organic Compounds for the Detection of Prostate Cancer," PLOS ONE, Public Library of Science, vol. 10(11), pages 1-15, November.
    15. Zazueta, Jorge & Heredia, Andrea Chavez & Zazueta-Hernández, Jorge, 2021. "Endogenous Prediction of Bankruptcy using a Support Vector Machine," SocArXiv ehpt7, Center for Open Science.
    16. Kresova, Svetlana & Hess, Sebastian, 2021. "Determinants of Regional Raw Milk Prices in Russia," 61st Annual Conference, Berlin, Germany, September 22-24, 2021 317051, German Association of Agricultural Economists (GEWISOLA).
    17. Gehan A. Mousa & Elsayed A. H. Elamir & Khaled Hussainey, 2022. "Using machine learning methods to predict financial performance: Does disclosure tone matter?," International Journal of Disclosure and Governance, Palgrave Macmillan, vol. 19(1), pages 93-112, March.
    18. Carlos Família & Sarah R Dennison & Alexandre Quintas & David A Phoenix, 2015. "Prediction of Peptide and Protein Propensity for Amyloid Formation," PLOS ONE, Public Library of Science, vol. 10(8), pages 1-16, August.
    19. Noémi Kreif & Richard Grieve & Iván Díaz & David Harrison, 2015. "Evaluation of the Effect of a Continuous Treatment: A Machine Learning Approach with an Application to Treatment for Traumatic Brain Injury," Health Economics, John Wiley & Sons, Ltd., vol. 24(9), pages 1213-1228, September.
    20. Abhilash Bandam & Eedris Busari & Chloi Syranidou & Jochen Linssen & Detlef Stolten, 2022. "Classification of Building Types in Germany: A Data-Driven Modeling Approach," Data, MDPI, vol. 7(4), pages 1-23, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:328:y:2023:i:1:d:10.1007_s10479-022-04984-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.