Author
Listed:
- Shamsuz Zoha
- Shahin Alam
- Isteaq Kabir Sifat
- Nourin Sultana
- Md Kaderi Kibria
Abstract
Cesarean section (C-section) rates have been rising globally, posing potential health risks for mothers and infants. Understanding the factors that contribute to C-section delivery and leveraging machine learning (ML) techniques for predictive modeling can support targeted interventions and informed policy decisions. This study aimed to identify the determinants of C-section delivery and develop an ML-based predictive model using data from the Bangladesh Demographic and Health Survey (BDHS) 2022. A total of 2,490 complete records of ever-married women aged 15–49 years were analyzed, where the delivery mode was categorized as vaginal or C-section. Three feature selection techniques including Recursive Feature Elimination (RFE), Boruta-based selection (BFS), and Random Forest (RF) were used to identify key risk factors. Six ML algorithms, including Logistic Regression (LR), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Decision Tree (DT), Random Forest (RF), and Extreme Gradient Boosting (XGB) were employed to predict C-section. Model performance was evaluated using accuracy, precision, recall, F1-score, AUC, and ROC analysis. SHAP values were used to interpret the influence of individual features.The prevalence of C-section deliveries was 45.6%, with an average maternal age of 25.7 years and mean age at first childbirth of 19.3 years. Ten significant determinants were identified, including place of delivery, baby weight, maternal BMI, birth interval, age at first birth, partner’s education, maternal age, wealth status, ANC visits, and maternal education. The RF model achieved the highest performance with an accuracy of 81.79%, and an AUC of 0.871. SHAP analysis highlighted that place of delivery, baby weight, maternal BMI, and birth interval were the most influential predictors. These findings suggest that socio-demographic and healthcare-related factors strongly influence C-section delivery. Machine learning models particularly the RF can effectively identify women at high risk, supporting strategies to reduce unnecessary C-sections and improve maternal healthcare planning in Bangladesh.
Suggested Citation
Shamsuz Zoha & Shahin Alam & Isteaq Kabir Sifat & Nourin Sultana & Md Kaderi Kibria, 2025.
"Identifying determinants and predicting cesarean section delivery among Bangladeshi women using machine learning: Insight from BDHS 2022 Data,"
PLOS Global Public Health, Public Library of Science, vol. 5(11), pages 1-18, November.
Handle:
RePEc:plo:pgph00:0005494
DOI: 10.1371/journal.pgph.0005494
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgph00:0005494. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: globalpubhealth (email available below). General contact details of provider: https://journals.plos.org/globalpublichealth .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.