IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v22y2025i6p934-d1678366.html
   My bibliography  Save this article

Predicting Low Birth Weight in Big Cities in the United States Using a Machine Learning Approach

Author

Listed:
  • Yulia Treister-Goltzman

    (Department of Family Medicine and Siaal Research Center for Family Practice and Primary Care, The Haim Doron Division of Community Health, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva 84161, Israel
    Clalit Health Services, Southern District, Beer-Sheva 84161, Israel)

Abstract

Objective: Low birth weight is a serious public health problem even in developed countries. The objective of this study was to assess the ability of machine learning to predict low birth weight rates in big cities in the USA on an ecological/population level. Study design: The study was based on publicly available data from the Big Cities Health Inventory Data Platform. The collected data related to the 35 largest, most urban cities in the United States from 2010 to 2022. The model-agnostic approach was used to assess and visualize the magnitude and direction of the most influential predictors. Results: The models showed excellent performance with R-squared values of 0.82, 0.81, 0.81, and 0.79, and residual root mean squared error values of 1.06, 0.87, 1.03, 0.99 for KNN, Best subset, Lasso, and XGBoost, respectively. It is noteworthy that the Best subset selection approach had a high RSq and the lowest residual root mean squared error, with only a four-predictor subset. Influential predictors that appeared in three/four models were rate of chlamydia infection, racial segregation, prenatal care, percentage of single-parent families, and poverty. Other important predictors were the rate of violent crimes, life expectancy, mental distress, income inequality, hazardous air quality, prevalence of hypertension, percent of foreign-born citizens, and smoking. This study was limited by the unavailability of data on gestational age. Conclusions: The machine learning algorithms showed excellent performance for the prediction of low birth weight rate in big cities. The identification of influential predictors can help local and state authorities and health policy decision makers to more effectively tackle this important health problem.

Suggested Citation

  • Yulia Treister-Goltzman, 2025. "Predicting Low Birth Weight in Big Cities in the United States Using a Machine Learning Approach," IJERPH, MDPI, vol. 22(6), pages 1-10, June.
  • Handle: RePEc:gam:jijerp:v:22:y:2025:i:6:p:934-:d:1678366
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/22/6/934/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/22/6/934/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Grossman, Daniel & Khalil, Umair, 2022. "Neighborhood crime and infant health," Journal of Urban Economics, Elsevier, vol. 130(C).
    2. Niemesh, Gregory T. & Shester, Katharine L., 2020. "Racial residential segregation and black low birth weight, 1970–2010," Regional Science and Urban Economics, Elsevier, vol. 83(C).
    3. Jireh Yi-Le Chan & Steven Mun Hong Leow & Khean Thye Bea & Wai Khuen Cheng & Seuk Wai Phoong & Zeng-Wei Hong & Yen-Lin Chen, 2022. "Mitigating the Multicollinearity Problem and Its Machine Learning Approach: A Review," Mathematics, MDPI, vol. 10(8), pages 1-17, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nagwan Abdel Samee & Ghada Atteia & Souham Meshoul & Mugahed A. Al-antari & Yasser M. Kadah, 2022. "Deep Learning Cascaded Feature Selection Framework for Breast Cancer Classification: Hybrid CNN with Univariate-Based Approach," Mathematics, MDPI, vol. 10(19), pages 1-27, October.
    2. Zhang, Jianhong & van Witteloostuijn, Arjen & Zhou, Chaohong & Zhou, Shengyang, 2024. "Cross-border acquisition completion by emerging market MNEs revisited: Inductive evidence from a machine learning analysis," Journal of World Business, Elsevier, vol. 59(2).
    3. Nelly Exbrayat & Victor Stephane, 2024. "Does Urbanization Cause Crime? Evidence from Rural-Urban Migration in South Africa," Working Papers halshs-04390026, HAL.
    4. Liu, Yang & Min, Shisheng & Shi, Zhuangbin & He, Mingwei, 2024. "Exploring students' choice of active travel to school in different spatial environments: A case study in a mountain city," Journal of Transport Geography, Elsevier, vol. 115(C).
    5. Wai Khuen Cheng & Khean Thye Bea & Steven Mun Hong Leow & Jireh Yi-Le Chan & Zeng-Wei Hong & Yen-Lin Chen, 2022. "A Review of Sentiment, Semantic and Event-Extraction-Based Approaches in Stock Forecasting," Mathematics, MDPI, vol. 10(14), pages 1-20, July.
    6. Merlino, Luca Paolo & Steinhardt, Max Friedrich & Wren-Lewis, Liam, 2024. "The long run impact of childhood interracial contact on residential segregation," Journal of Public Economics, Elsevier, vol. 239(C).
    7. de Bruin, Sophie & Hoch, Jannis & de Bruijn, Jens & Hermans, Kathleen & Maharjan, Amina & Kummu, Matti & van Vliet, Jasper, 2024. "Scenario projections of South Asian migration patterns amidst environmental and socioeconomic change," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 88, pages 1-12.
    8. Mònica González-Carrasco & Silvana Aciar & Ferran Casas & Xavier Oriol & Ramon Fabregat & Sara Malo, 2024. "A Machine Learning Approach to Well-Being in Late Childhood and Early Adolescence: The Children’s Worlds Data Case," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 175(1), pages 25-47, October.
    9. repec:osf:socarx:hyau2_v1 is not listed on IDEAS
    10. Cheng, Louis T.W. & Cheong, Tsun Se & Wojewodzki, Michal & Chui, David, 2025. "The effect of ESG divergence on the financial performance of Hong Kong-listed firms: An artificial neural network approach," Research in International Business and Finance, Elsevier, vol. 73(PA).
    11. Zerong Wang, 2022. "The Influence of Ethnic Identity on the Academic Performance of Chinese College Students: An Empirical Study Based on the Administrative Data of a University," Review of Economic Assessment, Anser Press, vol. 1(1), pages 1-21, December.
    12. Vu, Cecilia & Arcaya, Mariana C. & Kawachi, Ichiro & Williams, David R., 2023. "Moving to opportunity? Low birth weight outcomes among Southern-born Black mothers during the Great Migration," Social Science & Medicine, Elsevier, vol. 328(C).
    13. Cong Cheng & Jian Dai, 2025. "Predicting Cross-border Merger and Acquisition Completion through CEO Characteristics: A Machine Learning Approach," Management International Review, Springer, vol. 65(1), pages 43-84, February.
    14. You, Geonhwa, 2024. "A comprehensive approach for calibrating anthropogenic effects on atmosphere degradation," Renewable and Sustainable Energy Reviews, Elsevier, vol. 191(C).
    15. Tran Ngoc Mai, 2023. "Renewable Energy, GDP (Gross Domestic Product), FDI (Foreign Direct Investment) and CO2 Emissions in Southeast Asia Countries," International Journal of Energy Economics and Policy, Econjournals, vol. 13(2), pages 284-289, March.
    16. Hoxha, Julian & Çodur, Muhammed Yasin & Mustafaraj, Enea & Kanj, Hassan & El Masri, Ali, 2023. "Prediction of transportation energy demand in Türkiye using stacking ensemble models: Methodology and comparative analysis," Applied Energy, Elsevier, vol. 350(C).
    17. Martin Boďa & David Cole & Mária Murray Svidroňová & Jolana Gubalová, 2022. "Prevailing narratives versus reality of a small and medium town decline in a CEE country," Operational Research, Springer, vol. 22(3), pages 3113-3145, July.
    18. Yang Zhu & Zeqi Zhou & Jingjing Zhou & Xiuping Xu & Xiaogang Wu & Wen Nie, 2025. "An improved 3D Dijkstra algorithm of evacuation route considering tailings dam failure," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 121(3), pages 2483-2505, February.
    19. Blanco-Oliver Antonio & Lara-Rubio Juan & Irimia-Diéguez Ana & Liébana-Cabanillas Francisco, 2024. "Examining user behavior with machine learning for effective mobile peer-to-peer payment adoption," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 10(1), pages 1-30, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:22:y:2025:i:6:p:934-:d:1678366. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.