Author
Listed:
- Nowell M Fine
- Sunil V Kalmady
- Weijie Sun
- Russ Greiner
- Jonathan G Howlett
- James A White
- Finlay A McAlister
- Justin A Ezekowitz
- Padma Kaul
Abstract
Aims: Patients visiting the emergency department (ED) or hospitalized for heart failure (HF) are at increased risk for subsequent adverse outcomes, however effective risk stratification remains challenging. We utilized a machine-learning (ML)-based approach to identify HF patients at risk of adverse outcomes after an ED visit or hospitalization using a large regional administrative healthcare data system. Methods and results: Patients visiting the ED or hospitalized with HF between 2002–2016 in Alberta, Canada were included. Outcomes of interest were 30-day and 1-year HF-related ED visits, HF hospital readmission or all-cause mortality. We applied a feature extraction method using deep feature synthesis from multiple sources of health data and compared performance of a gradient boosting algorithm (CatBoost) with logistic regression modelling. The area under receiver operating characteristic curve (AUC-ROC) was used to assess model performance. We included 50,630 patients with 93,552 HF ED visits/hospitalizations. At 30-day follow-up in the holdout validation cohort, the AUC-ROC for the combined endpoint of HF ED visit, HF hospital readmission or death for the Catboost and logistic regression models was 74.16 (73.18–75.11) versus 62.25 (61.25–63.18), respectively. At 1-year follow-up corresponding values were 76.80 (76.1–77.47) versus 69.52 (68.77–70.26), respectively. AUC-ROC values for the endpoint of all-cause death alone at 30-days and 1-year follow-up were 83.21 (81.83–84.41) versus 69.53 (67.98–71.18), and 85.73 (85.14–86.29) versus 69.40 (68.57–70.26), for the CatBoost and logistic regression models, respectively. Conclusions: ML-based modelling with deep feature synthesis provided superior risk stratification for HF patients at 30-days and 1-year follow-up after an ED visit or hospitalization using data from a large administrative regional healthcare system. Author summary: Patients visiting the emergency department (ED) or hospitalized for heart failure (HF) are at increased risk for subsequent adverse outcomes, however effective risk stratification remains challenging. Healthcare administrative or claims databases capture data on large, unselected cohorts of patients and can be used to monitor HF outcomes at a population-level. The primary objective of our study was to develop and validate a risk prediction ML model for 30-day and 1-year HF ED visits, hospital readmissions or death using administrative health data and evaluate its performance and potential utility from a large regional healthcare system. We used deep feature synthesis for automated feature engineering to pool patient information from multiple data sources and developed models using a gradient boosting-based ML algorithm (CatBoost). From a database that included over 50,000 HF patients, Catboost demonstrated superior prognostic utility and precision to both conventional logistic regression and other ML-based risk prediction modelling approaches for all study endpoints. This model may be of value to healthcare administrators for developing strategies to reduce adverse outcomes for hospitalized HF patients.
Suggested Citation
Nowell M Fine & Sunil V Kalmady & Weijie Sun & Russ Greiner & Jonathan G Howlett & James A White & Finlay A McAlister & Justin A Ezekowitz & Padma Kaul, 2024.
"Machine Learning For Risk Prediction After Heart Failure Emergency Department Visit or Hospital Admission Using Administrative Health Data,"
PLOS Digital Health, Public Library of Science, vol. 3(10), pages 1-17, October.
Handle:
RePEc:plo:pdig00:0000636
DOI: 10.1371/journal.pdig.0000636
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pdig00:0000636. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: digitalhealth (email available below). General contact details of provider: https://journals.plos.org/digitalhealth .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.