IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0309383.html

A machine learning-based prediction of hospital mortality in mechanically ventilated ICU patients

Author

Listed:
  • Hexin Li
  • Negin Ashrafi
  • Chris Kang
  • Guanlan Zhao
  • Yubing Chen
  • Maryam Pishgar

Abstract

Background: Mechanical ventilation (MV) is vital for critically ill ICU patients but carries significant mortality risks. This study aims to develop a predictive model to estimate hospital mortality among MV patients, utilizing comprehensive health data to assist ICU physicians with early-stage alerts. Methods: We developed a Machine Learning (ML) framework to predict hospital mortality in ICU patients receiving MV. Using the MIMIC-III database, we identified 25,202 eligible patients through ICD-9 codes. We employed backward elimination and the Lasso method, selecting 32 features based on clinical insights and literature. Data preprocessing included eliminating columns with over 90% missing data and using mean imputation for the remaining missing values. To address class imbalance, we used the Synthetic Minority Over-sampling Technique (SMOTE). We evaluated several ML models, including CatBoost, XGBoost, Decision Tree, Random Forest, Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Logistic Regression, using a 70/30 train-test split. The CatBoost model was chosen for its superior performance in terms of accuracy, precision, recall, F1-score, AUROC metrics, and calibration plots. Results: The study involved a cohort of 25,202 patients on MV. The CatBoost model attained an AUROC of 0.862, an increase from an initial AUROC of 0.821, which was the best reported in the literature. It also demonstrated an accuracy of 0.789, an F1-score of 0.747, and better calibration, outperforming other models. These improvements are due to systematic feature selection and the robust gradient boosting architecture of CatBoost. Conclusion: The preprocessing methodology significantly reduced the number of relevant features, simplifying computational processes, and identified critical features previously overlooked. Integrating these features and tuning the parameters, our model demonstrated strong generalization to unseen data. This highlights the potential of ML as a crucial tool in ICUs, enhancing resource allocation and providing more personalized interventions for MV patients.

Suggested Citation

  • Hexin Li & Negin Ashrafi & Chris Kang & Guanlan Zhao & Yubing Chen & Maryam Pishgar, 2024. "A machine learning-based prediction of hospital mortality in mechanically ventilated ICU patients," PLOS ONE, Public Library of Science, vol. 19(9), pages 1-16, September.
  • Handle: RePEc:plo:pone00:0309383
    DOI: 10.1371/journal.pone.0309383
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0309383
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0309383&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0309383?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Naomi George & Edward Moseley & Rene Eber & Jennifer Siu & Mathew Samuel & Jonathan Yam & Kexin Huang & Leo Anthony Celi & Charlotta Lindvall, 2021. "Deep learning to predict long-term mortality in patients requiring 7 days of mechanical ventilation," PLOS ONE, Public Library of Science, vol. 16(6), pages 1-13, June.
    2. Christopher Martin Sauer & David Sasson & Kenneth E Paik & Ned McCague & Leo Anthony Celi & Iván Sánchez Fernández & Ben M W Illigens, 2018. "Feature selection and prediction of treatment failure in tuberculosis," PLOS ONE, Public Library of Science, vol. 13(11), pages 1-14, November.
    3. Nima Safaei & Babak Safaei & Seyedhouman Seyedekrami & Mojtaba Talafidaryani & Arezoo Masoud & Shaodong Wang & Qing Li & Mahdi Moqri, 2022. "E-CatBoost: An efficient machine learning framework for predicting ICU mortality using the eICU Collaborative Research Database," PLOS ONE, Public Library of Science, vol. 17(5), pages 1-33, May.
    4. Limin Yu & Alexandra Halalau & Bhavinkumar Dalal & Amr E Abbas & Felicia Ivascu & Mitual Amin & Girish B Nair, 2021. "Machine learning methods to predict mechanical ventilation and mortality in patients with COVID-19," PLOS ONE, Public Library of Science, vol. 16(4), pages 1-18, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Anitha T. & Gopu G. & Arun Mozhi Devan P. & Maher Assaad, 2024. "Machine learning algorithm for ventilator mode selection, pressure and volume control," PLOS ONE, Public Library of Science, vol. 19(3), pages 1-34, March.
    2. Haron W Gichuhi & Mark Magumba & Manish Kumar & Roy William Mayega, 2023. "A machine learning approach to explore individual risk factors for tuberculosis treatment non-adherence in Mukono district," PLOS Global Public Health, Public Library of Science, vol. 3(7), pages 1-20, July.
    3. Eisa Nazar & Hossein Baghishani & Hassan Doosti & Vahid Ghavami & Ehsan Aryan & Mahshid Nasehi & Saeid Sharafi & Habibollah Esmaily & Jamshid Yazdani Charati, 2020. "Bayesian Spatial Survival Analysis of Duration to Cure among New Smear-Positive Pulmonary Tuberculosis (PTB) Patients in Iran, during 2011–2018," IJERPH, MDPI, vol. 18(1), pages 1-18, December.
    4. Miguel Contreras & Brandon Silva & Benjamin Shickel & Andrea Davidson & Tezcan Ozrazgat-Baslanti & Yuanfang Ren & Ziyuan Guan & Jeremy Balch & Jiaqing Zhang & Sabyasachi Bandyopadhyay & Tyler Loftus &, 2025. "Real-time prediction of intensive care unit patient acuity and therapy requirements using state-space modelling," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    5. Jihoon Moon & Muazzam Maqsood & Dayeong So & Sung Wook Baik & Seungmin Rho & Yunyoung Nam, 2024. "Advancing ensemble learning techniques for residential building electricity consumption forecasting: Insight from explainable artificial intelligence," PLOS ONE, Public Library of Science, vol. 19(11), pages 1-36, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0309383. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.