Author
Listed:
- Suman Biswas
- Md Mahamudul Islam
- Nusrat Islam
- Md Abdur Rahim Mia
Abstract
Overweight/obesity has become a critical global health issue, as these conditions are strongly associated with elevated risk of diabetes, stroke, cardiovascular disorders, and certain types of cancer. In recent decades, Bangladesh has faced a notable rise in overweight/obesity prevalence—women are more prone to obesity than men. This study presents a comprehensive strategy for identifying risk factors and predicting overweight and obesity through machine learning (ML) classifiers among ever-married Bangladeshi women aged 15–49 years. Data from the 2017–2018 BDHS, a nationally representative survey, were examined. The data were pre-processed and subsequently balanced using the synthetic minority over-sampling technique and edited nearest neighbors (SMOTE-ENN) approach. Various feature identification techniques, including Chi-Square, LASSO, and Sequential Forward Selection, were employed to determine the key risk features. Later, permutation feature importance and SHAP analysis were employed to assess the influence of these risk factors on overweight/obesity. The classification of overweight and obesity was conducted using seven machine learning models: Support Vector Machine (SVM), Logistic Regression (LR), Random Forest (RF), K-nearest Neighbors (KNN), eXtreme Gradient Boosting (XGBoost), Categorical Boosting (CatBoost), and Multilayer Perceptron (MLP). Among the evaluated models, SVM performed best, reaching 95.79% accuracy and 97.32% precision when combined with SMOTE-ENN and hyper-parameter tuning. The study found that key factors contributing to being overweight/obese include age, division, type of residence, educational levels of both the respondent and her partner, number of children, frequency of television viewing, and wealth status; where wealth status, age, and frequency of watching television have strong influences. Therefore, integrating the balancing algorithm with the embedded feature selection strategy was effective in classifying overweight/obese women and could enhance decision-making for preventive measures in public health through timely predictions of overweight/obesity.
Suggested Citation
Suman Biswas & Md Mahamudul Islam & Nusrat Islam & Md Abdur Rahim Mia, 2026.
"National data meets AI: Machine learning for predicting overweight/obesity among ever-married Bangladeshi women,"
PLOS ONE, Public Library of Science, vol. 21(2), pages 1-24, February.
Handle:
RePEc:plo:pone00:0341821
DOI: 10.1371/journal.pone.0341821
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0341821. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.