IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v22y2025i6p934-d1678366.html
   My bibliography  Save this article

Predicting Low Birth Weight in Big Cities in the United States Using a Machine Learning Approach

Author

Listed:
  • Yulia Treister-Goltzman

    (Department of Family Medicine and Siaal Research Center for Family Practice and Primary Care, The Haim Doron Division of Community Health, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva 84161, Israel
    Clalit Health Services, Southern District, Beer-Sheva 84161, Israel)

Abstract

Objective: Low birth weight is a serious public health problem even in developed countries. The objective of this study was to assess the ability of machine learning to predict low birth weight rates in big cities in the USA on an ecological/population level. Study design: The study was based on publicly available data from the Big Cities Health Inventory Data Platform. The collected data related to the 35 largest, most urban cities in the United States from 2010 to 2022. The model-agnostic approach was used to assess and visualize the magnitude and direction of the most influential predictors. Results: The models showed excellent performance with R-squared values of 0.82, 0.81, 0.81, and 0.79, and residual root mean squared error values of 1.06, 0.87, 1.03, 0.99 for KNN, Best subset, Lasso, and XGBoost, respectively. It is noteworthy that the Best subset selection approach had a high RSq and the lowest residual root mean squared error, with only a four-predictor subset. Influential predictors that appeared in three/four models were rate of chlamydia infection, racial segregation, prenatal care, percentage of single-parent families, and poverty. Other important predictors were the rate of violent crimes, life expectancy, mental distress, income inequality, hazardous air quality, prevalence of hypertension, percent of foreign-born citizens, and smoking. This study was limited by the unavailability of data on gestational age. Conclusions: The machine learning algorithms showed excellent performance for the prediction of low birth weight rate in big cities. The identification of influential predictors can help local and state authorities and health policy decision makers to more effectively tackle this important health problem.

Suggested Citation

  • Yulia Treister-Goltzman, 2025. "Predicting Low Birth Weight in Big Cities in the United States Using a Machine Learning Approach," IJERPH, MDPI, vol. 22(6), pages 1-10, June.
  • Handle: RePEc:gam:jijerp:v:22:y:2025:i:6:p:934-:d:1678366
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/22/6/934/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/22/6/934/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:22:y:2025:i:6:p:934-:d:1678366. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.