IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0303287.html
   My bibliography  Save this article

Developing machine learning models to predict multi-class functional outcomes and death three months after stroke in Sweden

Author

Listed:
  • Josline Adhiambo Otieno
  • Jenny Häggström
  • David Darehed
  • Marie Eriksson

Abstract

Globally, stroke is the third-leading cause of mortality and disability combined, and one of the costliest diseases in society. More accurate predictions of stroke outcomes can guide healthcare organizations in allocating appropriate resources to improve care and reduce both the economic and social burden of the disease. We aim to develop and evaluate the performance and explainability of three supervised machine learning models and the traditional multinomial logistic regression (mLR) in predicting functional dependence and death three months after stroke, using routinely-collected data. This prognostic study included adult patients, registered in the Swedish Stroke Registry (Riksstroke) from 2015 to 2020. Riksstroke contains information on stroke care and outcomes among patients treated in hospitals in Sweden. Prognostic factors (features) included demographic characteristics, pre-stroke functional status, cardiovascular risk factors, medications, acute care, stroke type, and severity. The outcome was measured using the modified Rankin Scale at three months after stroke (a scale of 0–2 indicates independent, 3–5 dependent, and 6 dead). Outcome prediction models included support vector machines, artificial neural networks (ANN), eXtreme Gradient Boosting (XGBoost), and mLR. The models were trained and evaluated on 75% and 25% of the dataset, respectively. Model predictions were explained using SHAP values. The study included 102,135 patients (85.8% ischemic stroke, 53.3% male, mean age 75.8 years, and median NIHSS of 3). All models demonstrated similar overall accuracy (69%–70%). The ANN and XGBoost models performed significantly better than the mLR in classifying dependence with F1-scores of 0.603 (95% CI; 0.594–0.611) and 0.577 (95% CI; 0.568–0.586), versus 0.544 (95% CI; 0.545–0.563) for the mLR model. The factors that contributed most to the predictions were expectedly similar in the models, based on clinical knowledge. Our ANN and XGBoost models showed a modest improvement in prediction performance and explainability compared to mLR using routinely-collected data. Their improved ability to predict functional dependence may be of particular importance for the planning and organization of acute stroke care and rehabilitation.

Suggested Citation

  • Josline Adhiambo Otieno & Jenny Häggström & David Darehed & Marie Eriksson, 2024. "Developing machine learning models to predict multi-class functional outcomes and death three months after stroke in Sweden," PLOS ONE, Public Library of Science, vol. 19(5), pages 1-16, May.
  • Handle: RePEc:plo:pone00:0303287
    DOI: 10.1371/journal.pone.0303287
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0303287
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0303287&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0303287?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. repec:plo:pmed00:0040296 is not listed on IDEAS
    2. Hamed Asadi & Richard Dowling & Bernard Yan & Peter Mitchell, 2014. "Machine Learning for Outcome Prediction of Acute Ischemic Stroke Post Intra-Arterial Therapy," PLOS ONE, Public Library of Science, vol. 9(2), pages 1-11, February.
    3. Agni Orfanoudaki & Emma Chesley & Christian Cadisch & Barry Stein & Amre Nouh & Mark J Alberts & Dimitris Bertsimas, 2020. "Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score," PLOS ONE, Public Library of Science, vol. 15(5), pages 1-20, May.
    4. Wenjuan Wang & Martin Kiik & Niels Peek & Vasa Curcin & Iain J Marshall & Anthony G Rudd & Yanzhong Wang & Abdel Douiri & Charles D Wolfe & Benjamin Bray, 2020. "A systematic review of machine learning models for predicting outcomes of stroke with structured data," PLOS ONE, Public Library of Science, vol. 15(6), pages 1-16, June.
    5. Esra Zihni & Vince Istvan Madai & Michelle Livne & Ivana Galinovic & Ahmed A Khalil & Jochen B Fiebach & Dietmar Frey, 2020. "Opening the black box of artificial intelligence for clinical decision support: A study predicting stroke outcome," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-15, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dimitris Bertsimas & Agni Orfanoudaki & Rory B. Weiner, 2020. "Personalized treatment for coronary artery disease patients: a machine learning approach," Health Care Management Science, Springer, vol. 23(4), pages 482-506, December.
    2. Yuta Miyazaki & Michiyuki Kawakami & Kunitsugu Kondo & Masahiro Tsujikawa & Kaoru Honaga & Kanjiro Suzuki & Tetsuya Tsuji, 2023. "Improvement of predictive accuracies of functional outcomes after subacute stroke inpatient rehabilitation by machine learning models," PLOS ONE, Public Library of Science, vol. 18(5), pages 1-14, May.
    3. Marcel Lucas Chee & Marcus Eng Hock Ong & Fahad Javaid Siddiqui & Zhongheng Zhang & Shir Lynn Lim & Andrew Fu Wah Ho & Nan Liu, 2021. "Artificial Intelligence Applications for COVID-19 in Intensive Care and Emergency Settings: A Systematic Review," IJERPH, MDPI, vol. 18(9), pages 1-15, April.
    4. Ching-Heng Lin & Ya-Wen Kuo & Yen-Chu Huang & Meng Lee & Yi-Wei Huang & Chang-Fu Kuo & Jiann-Der Lee, 2023. "Development and Validation of a Novel Score for Predicting Long-Term Mortality after an Acute Ischemic Stroke," IJERPH, MDPI, vol. 20(4), pages 1-12, February.
    5. Wenjuan Wang & Martin Kiik & Niels Peek & Vasa Curcin & Iain J Marshall & Anthony G Rudd & Yanzhong Wang & Abdel Douiri & Charles D Wolfe & Benjamin Bray, 2020. "A systematic review of machine learning models for predicting outcomes of stroke with structured data," PLOS ONE, Public Library of Science, vol. 15(6), pages 1-16, June.
    6. Yizhao Ni & Kathleen Alwell & Charles J Moomaw & Daniel Woo & Opeolu Adeoye & Matthew L Flaherty & Simona Ferioli & Jason Mackey & Felipe De Los Rios La Rosa & Sharyl Martini & Pooja Khatri & Dawn Kle, 2018. "Towards phenotyping stroke: Leveraging data from a large-scale epidemiological study to detect stroke diagnosis," PLOS ONE, Public Library of Science, vol. 13(2), pages 1-20, February.
    7. Nima Safaei & Babak Safaei & Seyedhouman Seyedekrami & Mojtaba Talafidaryani & Arezoo Masoud & Shaodong Wang & Qing Li & Mahdi Moqri, 2022. "E-CatBoost: An efficient machine learning framework for predicting ICU mortality using the eICU Collaborative Research Database," PLOS ONE, Public Library of Science, vol. 17(5), pages 1-33, May.
    8. Hebatullah Abdulazeem & Sera Whitelaw & Gunther Schauberger & Stefanie J Klug, 2023. "A systematic review of clinical health conditions predicted by machine learning diagnostic and prognostic models trained or validated using real-world primary health care data," PLOS ONE, Public Library of Science, vol. 18(9), pages 1-25, September.
    9. Aaron E Kornblith & Chandan Singh & Gabriel Devlin & Newton Addo & Christian J Streck & James F Holmes & Nathan Kuppermann & Jacqueline Grupp-Phelan & Jeffrey Fineman & Atul J Butte & Bin Yu, 2022. "Predictability and stability testing to assess clinical decision instrument performance for children after blunt torso trauma," PLOS Digital Health, Public Library of Science, vol. 1(8), pages 1-16, August.
    10. Yao Tong & Beilei Lin & Gang Chen & Zhenxiang Zhang, 2022. "Predicting Continuity of Asthma Care Using a Machine Learning Model: Retrospective Cohort Study," IJERPH, MDPI, vol. 19(3), pages 1-18, January.
    11. Esra Zihni & Vince Istvan Madai & Michelle Livne & Ivana Galinovic & Ahmed A Khalil & Jochen B Fiebach & Dietmar Frey, 2020. "Opening the black box of artificial intelligence for clinical decision support: A study predicting stroke outcome," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-15, April.
    12. Diana T Mosa & Amena Mahmoud & John Zaki & Shaymaa E Sorour & Shaker El-Sappagh & Tamer Abuhmed, 2023. "Henry gas solubility optimization double machine learning classifier for neurosurgical patients," PLOS ONE, Public Library of Science, vol. 18(5), pages 1-22, May.
    13. Wieslaw L Nowinski & Varsha Gupta & Guoyu Qian & Wojciech Ambrosius & Radoslaw Kazmierski, 2014. "Population-Based Stroke Atlas for Outcome Prediction: Method and Preliminary Results for Ischemic Stroke from CT," PLOS ONE, Public Library of Science, vol. 9(8), pages 1-11, August.
    14. Vieira, Bruno Hebling & Pamplona, Gustavo Santo Pedro & Fachinello, Karim & Silva, Alice Kamensek & Foss, Maria Paula & Salmon, Carlos Ernesto Garrido, 2022. "On the prediction of human intelligence from neuroimaging: A systematic review of methods and reporting," Intelligence, Elsevier, vol. 93(C).
    15. Ozlem Cosgun & Amjad Umar & Dursun Delen, 2024. "Operational assessment of nursing homes at times of pandemic: an integrated DEA and machine learning approach," Operational Research, Springer, vol. 24(4), pages 1-40, December.
    16. Khishigsuren Davagdorj & Van Huy Pham & Nipon Theera-Umpon & Keun Ho Ryu, 2020. "XGBoost-Based Framework for Smoking-Induced Noncommunicable Disease Prediction," IJERPH, MDPI, vol. 17(18), pages 1-22, September.
    17. Indy Man Kit Ho & Kai Yuen Cheong & Anthony Weldon, 2021. "Predicting student satisfaction of emergency remote learning in higher education during COVID-19 using machine learning techniques," PLOS ONE, Public Library of Science, vol. 16(4), pages 1-27, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0303287. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.