An interpretable LightGBM model for predicting coronary heart disease: Enhancing clinical decision-making with machine learning

My bibliography Save this article

An interpretable LightGBM model for predicting coronary heart disease: Enhancing clinical decision-making with machine learning

Author

Listed:

Lang Deng
Kongjie Lu
Huanhuan Hu

Registered:

Abstract

Background: Coronary Heart Disease (CHD) is one of the major burdens of cardiovascular diseases worldwide. Traditional diagnostic methods, such as coronary angiography and electrocardiogram, face challenges including high costs, subjectivity, and high misdiagnosis rates. To address these issues, this study proposes a prediction framework for CHD based on the LightGBM algorithm, aiming to improve the accuracy and interpretability of CHD risk prediction. Methods: This study utilized three publicly available datasets: BRFSS_2015, Framingham, and Z-Alizadeh Sani. The BRFSS_2015 dataset was used for model training, while the Framingham and Z-Alizadeh Sani datasets were employed for validation. Data preprocessing included cleaning, feature engineering, and handling missing values. The LightGBM model was selected for its efficiency and performance, and SHAP (SHapley Additive exPlanations) values were used to enhance model interpretability. Model performance was evaluated using metrics such as accuracy, precision, recall, F1-score, and AUROC. A CHD scoring system was developed based on the model’s predictions to assist clinicians in risk assessment. Results: The LightGBM model demonstrated excellent performance, achieving an accuracy of 90.60% and an AUROC of 81.06% on the BRFSS_2015 dataset. After parameter tuning, the model’s accuracy improved to 90.61%, and the AUROC increased to 81.11%. On the Framingham dataset, the accuracy improved from 83.96% to 85.26%, and the AUROC increased from 62.86% to 67.37%. On the Z-Alizadeh Sani dataset, the accuracy improved from 78.69% to 80.33%, and the precision increased from 74.40% to 76.36%. Conclusions: SHAP analysis revealed that age, smoking status, diabetes, hypertension, and high cholesterol were the most influential features in predicting CHD risk. The developed CHD scoring system provided a user-friendly tool for clinicians to assess patient risk levels effectively.

Suggested Citation

Lang Deng & Kongjie Lu & Huanhuan Hu, 2025. "An interpretable LightGBM model for predicting coronary heart disease: Enhancing clinical decision-making with machine learning," PLOS ONE, Public Library of Science, vol. 20(9), pages 1-26, September.

Handle: RePEc:plo:pone00:0330377
DOI: 10.1371/journal.pone.0330377

Download full text from publisher

References listed on IDEAS

Zhuye Jie & Huihua Xia & Shi-Long Zhong & Qiang Feng & Shenghui Li & Suisha Liang & Huanzi Zhong & Zhipeng Liu & Yuan Gao & Hui Zhao & Dongya Zhang & Zheng Su & Zhiwei Fang & Zhou Lan & Junhua Li & Li, 2017. "The gut microbiome in atherosclerotic cardiovascular disease," Nature Communications, Nature, vol. 8(1), pages 1-12, December.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Qi Su & Qin Liu & Raphaela Iris Lau & Jingwan Zhang & Zhilu Xu & Yun Kit Yeoh & Thomas W. H. Leung & Whitney Tang & Lin Zhang & Jessie Q. Y. Liang & Yuk Kam Yau & Jiaying Zheng & Chengyu Liu & Mengjin, 2022. "Faecal microbiome-based machine learning for multi-class disease diagnosis," Nature Communications, Nature, vol. 13(1), pages 1-8, December.
Wanting Dong & Xinyue Fan & Yaqiong Guo & Siyi Wang & Shulei Jia & Na Lv & Tao Yuan & Yuanlong Pan & Yong Xue & Xi Chen & Qian Xiong & Ruifu Yang & Weigang Zhao & Baoli Zhu, 2024. "An expanded database and analytical toolkit for identifying bacterial virulence factors and their associations with chronic diseases," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
Martin Stocker & Claus Klingenberg & Lars Navér & Viveka Nordberg & Alberto Berardi & Salhab el Helou & Gerhard Fusch & Joseph M. Bliss & Dirk Lehnick & Varvara Dimopoulou & Nicholas Guerina & Joanna , 2023. "Less is more: Antibiotics at the beginning of life," Nature Communications, Nature, vol. 14(1), pages 1-9, December.
Braden T. Tierney & Jonathan Foox & Krista A. Ryon & Daniel Butler & Namita Damle & Benjamin G. Young & Christopher Mozsary & Kristina M. Babler & Xue Yin & Yamina Carattini & David Andrews & Alexande, 2024. "Towards geospatially-resolved public-health surveillance via wastewater sequencing," Nature Communications, Nature, vol. 15(1), pages 1-18, December.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0330377. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

An interpretable LightGBM model for predicting coronary heart disease: Enhancing clinical decision-making with machine learning

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data