IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0315865.html
   My bibliography  Save this article

Optimizing hypertension prediction using ensemble learning approaches

Author

Listed:
  • Isteaq Kabir Sifat
  • Md Kaderi Kibria

Abstract

Hypertension (HTN) prediction is critical for effective preventive healthcare strategies. This study investigates how well ensemble learning techniques work to increase the accuracy of HTN prediction models. Utilizing a dataset of 612 participants from Ethiopia, which includes 27 features potentially associated with HTN risk, we aimed to enhance predictive performance over traditional single-model methods. A multi-faceted feature selection approach was employed, incorporating Boruta, Lasso Regression, Forward and Backward Selection, and Random Forest feature importance, and found 13 common features that were considered for prediction. Five machine learning (ML) models such as logistic regression (LR), artificial neural network (ANN), random forest (RF), extreme gradient boosting (XGB), light gradient boosting machine (LGBM), and a stacking ensemble model were trained using selected features to predict HTN. The models’ performance on the testing set was evaluated using accuracy, precision, recall, F1-score, and area under the curve (AUC). Additionally, SHapley Additive exPlanations (SHAP) was utilized to examine the impact of individual features on the models’ predictions and identify the most important risk factors for HTN. The stacking ensemble model emerged as the most effective approach for predicting HTN risk, achieving an accuracy of 96.32%, precision of 95.48%, recall of 97.51%, F1-score of 96.48%, and an AUC of 0.971. SHAP analysis of the stacking model identified weight, drinking habits, history of hypertension, salt intake, age, diabetes, BMI, and fat intake as the most significant and interpretable risk factors for HTN. Our results demonstrate significant advancements in predictive accuracy and robustness, highlighting the potential of ensemble learning as a pivotal tool in healthcare analytics. This research contributes to ongoing efforts to optimize HTN prediction models, ultimately supporting early intervention and personalized healthcare management.

Suggested Citation

  • Isteaq Kabir Sifat & Md Kaderi Kibria, 2024. "Optimizing hypertension prediction using ensemble learning approaches," PLOS ONE, Public Library of Science, vol. 19(12), pages 1-17, December.
  • Handle: RePEc:plo:pone00:0315865
    DOI: 10.1371/journal.pone.0315865
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0315865
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0315865&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0315865?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Weizhang Liang & Suizhi Luo & Guoyan Zhao & Hao Wu, 2020. "Predicting Hard Rock Pillar Stability Using GBDT, XGBoost, and LightGBM Algorithms," Mathematics, MDPI, vol. 8(5), pages 1-17, May.
    2. Thomas Mroz & Michael Griffin & Richard Cartabuke & Luke Laffin & Giavanna Russo-Alvarez & George Thomas & Nicholas Smedira & Thad Meese & Michael Shost & Ghaith Habboub, 2024. "Predicting hypertension control using machine learning," PLOS ONE, Public Library of Science, vol. 19(3), pages 1-14, March.
    3. Mohammad Ziaul Islam Chowdhury & Iffat Naeem & Hude Quan & Alexander A Leung & Khokan C Sikdar & Maeve O’Beirne & Tanvir C Turin, 2022. "Prediction of hypertension using traditional regression and machine learning models: A systematic review and meta-analysis," PLOS ONE, Public Library of Science, vol. 17(4), pages 1-30, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sezer Kanbul & Idris Adamu & Yakubu Bala Mohammed, 2024. "A Global Outlook on AI-Predicted Impacts of ChatGPT on Contemporary Education," SAGE Open, , vol. 14(3), pages 21582440241, August.
    2. Yanqi Gong & Chunyi Wang & Hongxuan Fu & Sandylove Afrane & Pingjian Yang & Jian-Lin Chen & Guozhu Mao, 2025. "Spatiotemporal Analysis and Prediction of Avian Migration Under Climate Change," Sustainability, MDPI, vol. 17(7), pages 1-27, March.
    3. Jin, Scarlett T. & Sui, Daniel Z., 2024. "A comparative analysis of the spatial determinants of e-bike and e-scooter sharing link flows," Journal of Transport Geography, Elsevier, vol. 119(C).
    4. Yu, Ruyang & Zhang, Kai & Ramasubramanian, Brindha & Jiang, Shu & Ramakrishna, Seeram & Tang, Yuhang, 2024. "Ensemble learning for predicting average thermal extraction load of a hydrothermal geothermal field: A case study in Guanzhong Basin, China," Energy, Elsevier, vol. 296(C).
    5. Xizi Wang & Yakun Ma & Guangwei Hu, 2024. "Mobile Platforms as the Alleged Culprit for Work–Life Imbalance: A Data-Driven Method Using Co-Occurrence Network and Explainable AI Framework," Sustainability, MDPI, vol. 16(18), pages 1-22, September.
    6. Jia, Liangyuan & Shao, Wanyun & Wang, Jingjing & Qian, Yingying & Chen, Yingquan & Yang, Qingchun, 2024. "Machine learning-aided prediction of bio-BTX and olefins production from zeolite-catalyzed biomass pyrolysis," Energy, Elsevier, vol. 306(C).
    7. Md Merajul Islam & Md Jahangir Alam & Md Maniruzzaman & N A M Faisal Ahmed & Md Sujan Ali & Md Jahanur Rahman & Dulal Chandra Roy, 2023. "Predicting the risk of hypertension using machine learning algorithms: A cross sectional study in Ethiopia," PLOS ONE, Public Library of Science, vol. 18(8), pages 1-20, August.
    8. Leilei Liu & Guoyan Zhao & Weizhang Liang, 2023. "Slope Stability Prediction Using k -NN-Based Optimum-Path Forest Approach," Mathematics, MDPI, vol. 11(14), pages 1-31, July.
    9. Babek Erdebilli & Burcu Devrim-İçtenbaş, 2022. "Ensemble Voting Regression Based on Machine Learning for Predicting Medical Waste: A Case from Turkey," Mathematics, MDPI, vol. 10(14), pages 1-16, July.
    10. Ji, Shujuan & Wang, Xin & Lyu, Tao & Liu, Xiaojie & Wang, Yuanqing & Heinen, Eva & Sun, Zhenwei, 2022. "Understanding cycling distance according to the prediction of the XGBoost and the interpretation of SHAP: A non-linear and interaction effect analysis," Journal of Transport Geography, Elsevier, vol. 103(C).
    11. Ning Li & Masoud Zare & Congke Yi & Rafael Jimenez, 2022. "Stability Risk Assessment of Underground Rock Pillars Using Logistic Model Trees," IJERPH, MDPI, vol. 19(4), pages 1-19, February.
    12. Orel Babayoff & Onn Shehory & Meishar Shahoha & Ruth Sasportas & Ahuva Weiss-Meilik, 2022. "Surgery duration: Optimized prediction and causality analysis," PLOS ONE, Public Library of Science, vol. 17(8), pages 1-18, August.
    13. Yiheng Li & Weidong Chen, 2020. "A Comparative Performance Assessment of Ensemble Learning for Credit Scoring," Mathematics, MDPI, vol. 8(10), pages 1-19, October.
    14. Jin, Scarlett T. & Wang, Lei & Sui, Daniel, 2023. "How the built environment affects E-scooter sharing link flows: A machine learning approach," Journal of Transport Geography, Elsevier, vol. 112(C).
    15. Chuanmin Mi & Mingzhu Li & Annisa Fitria Wulandari, 2024. "Predicting video views of web series based on comment sentiment analysis and improved stacking ensemble model," Electronic Commerce Research, Springer, vol. 24(4), pages 2637-2664, December.
    16. Shaohan Zhang & Shucheng Tan & Yongqi Sun & Duanyu Ding & Wei Yang, 2024. "Risk Mapping of Geological Hazards in Plateau Mountainous Areas Based on Multisource Remote Sensing Data Extraction and Machine Learning (Fuyuan, China)," Land, MDPI, vol. 13(9), pages 1-25, August.
    17. Mahmood Ahmad & Suraparb Keawsawasvong & Mohd Rasdan Bin Ibrahim & Muhammad Waseem & Kazem Reza Kashyzadeh & Mohanad Muayad Sabri Sabri, 2022. "Novel Approach to Predicting Soil Permeability Coefficient Using Gaussian Process Regression," Sustainability, MDPI, vol. 14(14), pages 1-15, July.
    18. Niaz Muhammad Shahani & Xigui Zheng & Xiaowei Guo & Xin Wei, 2022. "Machine Learning-Based Intelligent Prediction of Elastic Modulus of Rocks at Thar Coalfield," Sustainability, MDPI, vol. 14(6), pages 1-24, March.
    19. Jin, Tanhua & Cheng, Long & Liu, Zhicheng & Cao, Jun & Huang, Haosheng & Witlox, Frank, 2022. "Nonlinear public transit accessibility effects on housing prices: Heterogeneity across price segments," Transport Policy, Elsevier, vol. 117(C), pages 48-59.
    20. Yi-Hsueh Liu & Szu-Chia Chen & Wen-Hsien Lee & Ying-Chih Chen & Po-Chao Hsu & Wei-Chung Tsai & Chee-Siong Lee & Tsung-Hsien Lin & Chih-Hsing Hung & Chao-Hung Kuo & Ho-Ming Su, 2022. "Prognostic Factors of New-Onset Hypertension in New and Traditional Hypertension Definition in a Large Taiwanese Population Follow-up Study," IJERPH, MDPI, vol. 19(24), pages 1-10, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0315865. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.