IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0343796.html

Applying machine learning to predict stunting in children under 5 years old based on water, sanitation and hygiene behaviors and infrastructure

Author

Listed:
  • Sanaya Sinharoy
  • Heather Reese
  • Thomas Clasen
  • Sheela S Sinharoy

Abstract

Objective: Child stunting continues to pose a substantial global health challenge, requiring multifaceted strategies that combine conventional epidemiological approaches with advanced analytic methods. The aim of this study was to determine the most effective machine learning model for predicting stunting based on water, sanitation, and hygiene behaviors and infrastructure, with the goal of identifying high-risk children who would benefit most from targeted interventions. Methods: This study was a secondary analysis of data from a matched cohort study assessing the effectiveness of combined on-premise piped water and improved sanitation for improved health outcomes in rural Odisha, India. Data for the parent study were collected from 2,398 households with a child under five years of age across 90 villages, and complete data were available for 1,196 children. Feature engineering techniques were employed to identify the most relevant predictors and utilized structural equation modeling, forward selection, backward elimination, and least absolute shrinkage and selection operator techniques. Five machine learning algorithms commonly used for binary classification tasks were compared: logistic regression, classification tree, support vector machine, neural network, and extreme gradient boosting. Results: Among 1,196 children analyzed, the extreme gradient boosting model with forward selection feature engineering best predicted stunting based on water, sanitation, and hygiene (WaSH) factors. It correctly identified 81% of stunted children and 92% of non-stunted children, with an overall accuracy of 88%. The model’s area under the receiver operating characteristic curve (AUROC) was 0.959 (95% CI: 0.949–0.968), indicating that WaSH factors strongly predict child stunting when analyzed using this advanced machine learning technique. Four WaSH factors were identified as having the strongest power to predict stunting in our sample: improved sanitation coverage, presence of a handwashing station, piped water coverage, and availability of preferred drinking water source. Conclusions: The results demonstrate the efficacy of machine learning algorithms, especially extreme gradient boosting to potentially inform targeted WaSH interventions for reducing childhood stunting in resource-limited settings. However, these findings require external validation in other populations, and the complete-case analysis approach (excluding 35% of children with missing data) may limit generalizability to settings with less systematic data collection.

Suggested Citation

  • Sanaya Sinharoy & Heather Reese & Thomas Clasen & Sheela S Sinharoy, 2026. "Applying machine learning to predict stunting in children under 5 years old based on water, sanitation and hygiene behaviors and infrastructure," PLOS ONE, Public Library of Science, vol. 21(3), pages 1-19, March.
  • Handle: RePEc:plo:pone00:0343796
    DOI: 10.1371/journal.pone.0343796
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0343796
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0343796&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0343796?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Perkins, Jessica M. & Kim, Rockli & Krishna, Aditi & McGovern, Mark & Aguayo, Victor M. & Subramanian, S.V., 2017. "Understanding the association between stunting and child development in low- and middle-income countries: Next steps for research and intervention," Social Science & Medicine, Elsevier, vol. 193(C), pages 101-109.
    2. Amy J. Pickering & Habiba Djebbari & Carolina Lopez & Massa Coulibaly & Maria Laura Alzua, 2015. "Effect of a community-led sanitation intervention on child diarrhoea and child growth in rural Mali: a cluster-randomised controlled trial," Post-Print hal-01456117, HAL.
    3. Galasso, Emanuela & Wagstaff, Adam, 2019. "The aggregate income losses from childhood stunting and the returns to a nutrition intervention aimed at reducing stunting," Economics & Human Biology, Elsevier, vol. 34(C), pages 225-238.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Holla,Alaka & Bendini,Maria Magdalena & Dinarte Diaz,Lelys Ileana & Trako,Iva, 2021. "Is Investment in Preprimary Education Too Low ? Lessons from (Quasi) ExperimentalEvidence across Countries," Policy Research Working Paper Series 9723, The World Bank.
    2. María Laura Alzúa & Habiba Djebbari & Amy J. Pickering, 2020. "A Community-Based Program Promotes Sanitation," Economic Development and Cultural Change, University of Chicago Press, vol. 68(2), pages 357-390.
    3. Nguyen, Trung Thanh & Nguyen, Thanh-Tung & Do, Manh Hung & Rahut, Dil & Nguyen, Duy Linh, 2025. "Remittances, sanitation and child malnutrition in middle-income countries: A case study from rural Northeast Thailand and Central Vietnam," World Development, Elsevier, vol. 190(C).
    4. Kim, Rockli & Rajpal, Sunil & Joe, William & Corsi, Daniel J. & Sankar, Rajan & Kumar, Alok & Subramanian, S.V., 2019. "Assessing associational strength of 23 correlates of child anthropometric failure: An econometric analysis of the 2015-2016 National Family Health Survey, India," Social Science & Medicine, Elsevier, vol. 238(C), pages 1-1.
    5. Job Wasonga & Kazuchiyo Miyamichi & Mami Hitachi & Rie Ozaki & Mohamed Karama & Kenji Hirayama & Satoshi Kaneko, 2023. "Effects of Community-Led Total Sanitation (CLTS) Boosting and Household Factors on Latrine Ownership in Siaya County, Kenya," IJERPH, MDPI, vol. 20(18), pages 1-12, September.
    6. Augsburg, Britta & Caeyers, Bet & Giunti, Sara & Malde, Bansi & Smets, Susanna, 2023. "Labeled loans and human capital investments," Journal of Development Economics, Elsevier, vol. 162(C).
    7. Augsburg, Britta & Malde, Bansi & Olorenshaw, Harriet & Wahhaj, Zaki, 2023. "To invest or not to invest in sanitation: The role of intra-household gender differences in perceptions and bargaining power," Journal of Development Economics, Elsevier, vol. 162(C).
    8. Oliver Cumming & Benjamin F. Arnold & Radu Ban & Thomas Clasen & Joanna Esteves Mills & Matthew C. Freeman & Bruce Gordon & Raymond Guiteras & Guy Howard & Paul R. Hunter & Richard B. Johnston & Amy J, "undated". "The Implications of Three Major New Trials for the Effect of Water, Sanitation, and Hygiene on Childhood Diarrhea and Stunting: A Consensus Statement," Mathematica Policy Research Reports a98f913f56cd44caba883fff2, Mathematica Policy Research.
    9. Schneider, Eric & Ogasawara, Kota & Cole, Tim J., 2020. "The Effect of the Second World War on the Growth Pattern of Height in Japanese Children: Catch-up Growth, Critical Windows and," CEPR Discussion Papers 14808, C.E.P.R. Discussion Papers.
    10. Khalid Abu-Ismail & Verena Gantner & Paul Makdissi & Myra Yazbeck, 2020. "Socioeconomic inequalities in child malnutrition in Egypt," METRON, Springer;Sapienza Università di Roma, vol. 78(2), pages 175-191, August.
    11. Harter, Miriam & Inauen, Jennifer & Mosler, Hans-Joachim, 2020. "How does Community-Led Total Sanitation (CLTS) promote latrine construction, and can it be improved? A cluster-randomized controlled trial in Ghana," Social Science & Medicine, Elsevier, vol. 245(C).
    12. Cameron, Lisa & Chase, Claire & Haque, Sabrina & Joseph, George & Pinto, Rebekah & Wang, Qiao, 2021. "Childhood stunting and cognitive effects of water and sanitation in Indonesia," Economics & Human Biology, Elsevier, vol. 40(C).
    13. Orgill-Meyer, Jennifer & Pattanayak, Subhrendu K., 2020. "Improved sanitation increases long-term cognitive test scores," World Development, Elsevier, vol. 132(C).
    14. Augsburg, Britta & Baquero, Juan P. & Gautam, Sanghmitra & Rodriguez-Lesmes, Paul, 2023. "Sanitation and marriage markets in India: Evidence from the Total Sanitation Campaign," Journal of Development Economics, Elsevier, vol. 163(C).
    15. Abdullah, Alhassan & Emery, Clifton & Xu, Yanfeng & Mensah, Felix, 2025. "Associations between child neglect, informal interventions in food neglect, and child stunting: Evidence from the Ghana families study," Children and Youth Services Review, Elsevier, vol. 172(C).
    16. Emily C Moody & Elena Colicino & Robert O Wright & Ezekiel Mupere & Ericka G Jaramillo & Chitra Amarasiriwardena & Sarah E Cusick, 2020. "Environmental exposure to metal mixtures and linear growth in healthy Ugandan children," PLOS ONE, Public Library of Science, vol. 15(5), pages 1-13, May.
    17. Adolfo Meisel-Roca & Angela Granger, 2021. "The Height of Children and Adolescents in Colombia. A Review of More than Sixty Years of Anthropometric Studies, 1957–2020," IJERPH, MDPI, vol. 18(16), pages 1-22, August.
    18. Deb, Saubhik & Joseph, George & Andrés, Luis Alberto & Zabludovsky, Jonathan Grabinsky, 2024. "Is the glass half full or half empty? Examining the impact of Swatch Bharat interventions on sanitation and hygiene in rural Punjab, India," Journal of Development Economics, Elsevier, vol. 170(C).
    19. Bakhtiar, M. Mehrab & Guiteras, Raymond P. & Levinsohn, James & Mobarak, Ahmed Mushfiq, 2023. "Social and financial incentives for overcoming a collective action problem," Journal of Development Economics, Elsevier, vol. 162(C).
    20. Augsburg,Britta & Caeyers,Bet & Giunti,Sara & Malde,Bansi Khimji & Smets,Susanna, 2019. "Labelled Loans, Credit Constraints and Sanitation Investments," Policy Research Working Paper Series 8845, The World Bank.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0343796. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.