Author
Listed:
- Bikash Sadhukhan
(Department of Computer Science and Engineering, Techno International New Town, Kolkata 700156, India)
- Pratick Gupta
(Department of Computer Science and Engineering, Techno International New Town, Kolkata 700156, India)
- Atulya Narayan
(Department of Computer Science and Engineering, Techno International New Town, Kolkata 700156, India)
- Akshay Kumar Mourya
(Department of Computer Science and Engineering, Techno International New Town, Kolkata 700156, India)
- Shivam Kumar
(Department of Computer Science and Engineering, Techno International New Town, Kolkata 700156, India)
Abstract
Cardiovascular diseases are prominent contributors to mortality worldwide, and timely identification is crucial for enhancing patient prognosis. The protracted and exhaustive diagnostic procedures that result in delayed diagnosis can culminate in precarious circumstances that are difficult or impossible to manage. The utilisation of machine learning (ML) methodologies has the potential to facilitate the timely prediction of heart disease based on specific medical reports, thereby affording individuals the convenience of conducting such assessments from the comfort of their own homes. Using a dataset consisting of medical records and clinical attributes, ten models were evaluated, including decision tree, K-nearest neighbours, gradient boosting, random forest, AdaBoost, support vector machine, logistic regression, naive Bayes, a hypertuned gradient boosting model, and a StackingCV ensemble model. Utilising performance metrics such as accuracy, precision, recall, F1-scores, and the ROC-AUC, their predictive capabilities were evaluated. The random forest classifier achieved an accuracy of 0.94, demonstrating its high discriminatory power in identifying cases of cardiovascular disease. With an accuracy of 0.91, the K-nearest neighbours model demonstrated its potential for accurate classification. Intriguingly, the hypertuned gradient boosting model significantly outperformed the baseline model, achieving an impressive accuracy of 0.96. Additionally, the StackingCV ensemble model demonstrated superior accuracy, recall, F1-scores, and an ROC–AUC of 0.99, surpassing all the individual classifiers. These results demonstrate the effectiveness of ML algorithms in the detection of heart disease. The random forest classifier, the hypertuned gradient boosting model, and the StackingCV ensemble models demonstrate high accuracy and show promise for implementation in clinical settings.
Suggested Citation
Bikash Sadhukhan & Pratick Gupta & Atulya Narayan & Akshay Kumar Mourya & Shivam Kumar, 2025.
"Empirical Analysis of Machine Learning and Stacking Ensemble Methods for Heart Disease Detection,"
Journal of Information & Knowledge Management (JIKM), World Scientific Publishing Co. Pte. Ltd., vol. 24(04), pages 1-26, August.
Handle:
RePEc:wsi:jikmxx:v:24:y:2025:i:04:n:s0219649225500285
DOI: 10.1142/S0219649225500285
Download full text from publisher
As the access to this document is restricted, you may want to
for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:jikmxx:v:24:y:2025:i:04:n:s0219649225500285. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/jikm/jikm.shtml .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.