Author
Listed:
- Md Merajul Islam
- Sujit Kumar
- Md A Salam
- Dulal Chandra Roy
- Md Rezaul Karim
Abstract
Background: Cardiovascular disease (CVD) encompasses a group of disorders that affect the heart and blood vessels, making it one of the leading causes of death globally, including in Bangladesh. Applying predictive modeling for the early identification and detection of CVD holds significant promise for saving lives by enhancing prediction precision through machine learning algorithms. Therefore, this study aimed to predict high-risk individuals for CVD using machine learning algorithms and identify its influencing predictors by association mining rules among individuals in Bangladesh. Materials and methods: This study utilized the most recent Bangladesh Demographic and Health Survey (BDHS) 2022 data, which encompassed 2,221 respondents. A Boruta-based feature selection method is employed to determine the important features associated with the high risk of CVD. Different machine learning algorithms, including logistic regression, Naïve Bayes, artificial neural network, random forest, and extreme gradient boosting (XGB), are adopted to predict the high-risk individuals for CVD in the training dataset. The predictive performance of the models is evaluated using accuracy, precision, recall, F1-score, and area under the curve (AUC) in the testing set. Additionally, the most significant rules are analyzed using the association mining technique to identify the influencing predictors of high risk of CVD. Results: The Boruta method indicated that age, residence, marital status, wealth, having an air conditioner (AC), and body mass index (BMI) are important predictors of high risk of CVD. The XGB-based predictive model achieves impressive performance compared to other models, with an accuracy of 68.22%, precision of 69.70%, F1-score of 79.54%, and AUC of 0.721. The association rules identified that being aged 65 or older, living in an urban area, having the richest wealth status, having AC, and being widowed are the influencing predictors of high risk of CVD. Conclusions: This study emphasizes the potential of XGB in predicting high-risk individuals for CVD and enhances the investigation of key factors contributing to CVD risk in this population, thereby facilitating the development of targeted prevention strategies that can effectively mitigate the high CVD risk.
Suggested Citation
Md Merajul Islam & Sujit Kumar & Md A Salam & Dulal Chandra Roy & Md Rezaul Karim, 2025.
"Cardiovascular risk prediction and influencing predictors identification among Bangladeshi individuals using machine learning algorithms and association rule mining,"
PLOS ONE, Public Library of Science, vol. 20(10), pages 1-18, October.
Handle:
RePEc:plo:pone00:0333913
DOI: 10.1371/journal.pone.0333913
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0333913. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.