IDEAS home Printed from https://ideas.repec.org/a/plo/pgph00/0005187.html
   My bibliography  Save this article

Machine learning based prediction of low birth weight and its associated risk factors: Insights from the Bangladesh Demographic and Health Survey 2022

Author

Listed:
  • Nourin Sultana
  • Zeba Afia
  • Isteaq Kabir Sifat
  • Shamsuz Zoha
  • Tajin Ahmed Jisa
  • Md Kaderi Kibria

Abstract

Low birth weight (LBW) is a major public health concern particularly in low and middle-income countries as it contributes to increased infant mortality and long-term health complications. This study applies and evaluates machine learning (ML) algorithms to predict LBW and identify its key risk factors in Bangladesh. Data were collected from 3,192 complete records of ever-married women aged 15–49 years from the Bangladesh Demographic and Health Survey, 2022. Risk factors for LBW were identified by four feature selection techniques including Boruta-based selection (BFS), LASSO regression, Elastic Net and Random Forest (RF). Six ML algorithms, including Logistic Regression (LR), RF, Decision Tree (DT), Artificial Neural Networks (ANN), Extreme Gradient Boosting (XGB), and Light Gradient Boosting Machine (LGBM) were performed to predict LBW. Model performance was evaluated using accuracy, precision, recall, F1-score, AUC, and ROC analysis. SHAP values were utilized to examine the influence of individual features on the model’s prediction. The prevalence of LBW in Bangladesh was 27.8%. Twelve features were identified and the XGB model outperformed the other models by achieving the highest performance in predicting LBW with an accuracy of 80% and area under the curve of 0.761 in holdout (90:10) cross-validation. SHAP analysis revealed that ‘pregnancy duration’ and ‘division’ were the strongest predictors of LBW risk followed by ‘marriage to first birth interval’ ‘ANC visits’ ‘C-section’ and ‘place of delivery’. These findings demonstrate that XGB can serve as an effective tool for predicting LBW and identifying important risk factors that may guide targeted interventions. The insights generated from this study can support public health strategies aimed at reducing LBW prevalence in Bangladesh.

Suggested Citation

  • Nourin Sultana & Zeba Afia & Isteaq Kabir Sifat & Shamsuz Zoha & Tajin Ahmed Jisa & Md Kaderi Kibria, 2025. "Machine learning based prediction of low birth weight and its associated risk factors: Insights from the Bangladesh Demographic and Health Survey 2022," PLOS Global Public Health, Public Library of Science, vol. 5(9), pages 1-17, September.
  • Handle: RePEc:plo:pgph00:0005187
    DOI: 10.1371/journal.pgph.0005187
    as

    Download full text from publisher

    File URL: https://journals.plos.org/globalpublichealth/article?id=10.1371/journal.pgph.0005187
    Download Restriction: no

    File URL: https://journals.plos.org/globalpublichealth/article/file?id=10.1371/journal.pgph.0005187&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgph.0005187?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgph00:0005187. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: globalpubhealth (email available below). General contact details of provider: https://journals.plos.org/globalpublichealth .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.