IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0335442.html

Predicting hypertension and identifying most important factors among married women in Bangladesh using machine learning approach

Author

Listed:
  • Novel Chandra Das
  • Probir Kumar Ghosh
  • Md Alamgir Hossain
  • Uddip Acharjee Shuvo
  • Nipa Rani Talukder
  • Fatema Khatun
  • Mohammad Ziaul Islam Chowdhury

Abstract

Introduction: Hypertension is a leading contributor to maternal and cardiometabolic morbidity in Bangladesh. We developed and interpreted machine-learning (ML) models to predict hypertension and rank associated factors among married women with the goal of informing targeted screening and policy in low-resource settings. Methods: We analyzed 4,253 married women from the nationally representative BDHS 2017–18 survey (hypertension prevalence: 23.1%). Twelve ML algorithms were trained under six class-balancing strategies with hyperparameters tuned via random search. Validation used a hold-out test set (80/20) and repeated stratified k-fold cross-validation; bootstrap confidence intervals were estimated for the selected model. Model performance was compared with parametric and non-parametric tests. To interpret results, SHAP was used to rank the top 20 predictors and visualize feature effects. Models quantify associations rather than causation. Results: The Extra Trees classifier with SMOTE+ENN achieved the best discrimination (F1 = 0.94; AUC-PR = 0.95; ROC-AUC = 0.95). Compared with the original imbalanced training, minority-class detection improved substantially (Extra Trees F1 increased from 0.08 to 0.94; recall from 0.04 to 0.95) while accuracy and ROC-AUC remained relatively stable across samplers. Statistical testing favored SMOTE+ENN for recall, F1, G-mean and AUC-PR. SHAP identified age, parity, recent births, contraceptive use, spousal education and BMI as key predictors. Younger age (

Suggested Citation

  • Novel Chandra Das & Probir Kumar Ghosh & Md Alamgir Hossain & Uddip Acharjee Shuvo & Nipa Rani Talukder & Fatema Khatun & Mohammad Ziaul Islam Chowdhury, 2025. "Predicting hypertension and identifying most important factors among married women in Bangladesh using machine learning approach," PLOS ONE, Public Library of Science, vol. 20(10), pages 1-34, October.
  • Handle: RePEc:plo:pone00:0335442
    DOI: 10.1371/journal.pone.0335442
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0335442
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0335442&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0335442?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Md Ismail Tareque & Atsushi Koshio & Andrew D Tiedt & Toshihiko Hasegawa, 2015. "Are the Rates of Hypertension and Diabetes Higher in People from Lower Socioeconomic Status in Bangladesh? Results from a Nationally Representative Survey," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-17, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Karar Zunaid Ahsan & Afrin Iqbal & Kanta Jamil & M Moinuddin Haider & Shusmita Hossain Khan & Nitai Chakraborty & Peter Kim Streatfield, 2022. "Socioeconomic disparities in diabetes prevalence and management among the adult population in Bangladesh," PLOS ONE, Public Library of Science, vol. 17(12), pages 1-19, December.
    2. Md Shariful Islam & Md Ismail Tareque & Md Nazrul Islam Mondal & Ahbab Mohammad Fazle Rabbi & Hafiz T A Khan & Sharifa Begum, 2017. "Urban-rural differences in disability-free life expectancy in Bangladesh using the 2010 HIES data," PLOS ONE, Public Library of Science, vol. 12(7), pages 1-13, July.
    3. Siying Li & Wenye Fan & Boya Zhu & Chao Ma & Xiaodong Tan & Yaohua Gu, 2022. "Frailty Risk Prediction Model among Older Adults: A Chinese Nation-Wide Cross-Sectional Study," IJERPH, MDPI, vol. 19(14), pages 1-13, July.
    4. Anubha Agarwal & Devraj Jindal & Vamadevan S Ajay & Dimple Kondal & Siddhartha Mandal & Shreeparna Ghosh & Mumtaj Ali & Kavita Singh & Mark D Huffman & Nikhil Tandon & Dorairaj Prabhakaran, 2019. "Association between socioeconomic position and cardiovascular disease risk factors in rural north India: The Solan Surveillance Study," PLOS ONE, Public Library of Science, vol. 14(7), pages 1-16, July.
    5. Carlos Rojas-Roque & Akram Hernández-Vásquez & Diego Azañedo & Guido Bendezu-Quispe, 2022. "Socioeconomic Inequalities in the Prevalence of Diabetes in Argentina: A Repeated Cross-Sectional Study in Urban Women and Men," IJERPH, MDPI, vol. 19(15), pages 1-12, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0335442. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.