IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v22y2025i6p934-d1678366.html

Predicting Low Birth Weight in Big Cities in the United States Using a Machine Learning Approach

Author

Listed:
  • Yulia Treister-Goltzman

    (Department of Family Medicine and Siaal Research Center for Family Practice and Primary Care, The Haim Doron Division of Community Health, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva 84161, Israel
    Clalit Health Services, Southern District, Beer-Sheva 84161, Israel)

Abstract

Objective: Low birth weight is a serious public health problem even in developed countries. The objective of this study was to assess the ability of machine learning to predict low birth weight rates in big cities in the USA on an ecological/population level. Study design: The study was based on publicly available data from the Big Cities Health Inventory Data Platform. The collected data related to the 35 largest, most urban cities in the United States from 2010 to 2022. The model-agnostic approach was used to assess and visualize the magnitude and direction of the most influential predictors. Results: The models showed excellent performance with R-squared values of 0.82, 0.81, 0.81, and 0.79, and residual root mean squared error values of 1.06, 0.87, 1.03, 0.99 for KNN, Best subset, Lasso, and XGBoost, respectively. It is noteworthy that the Best subset selection approach had a high RSq and the lowest residual root mean squared error, with only a four-predictor subset. Influential predictors that appeared in three/four models were rate of chlamydia infection, racial segregation, prenatal care, percentage of single-parent families, and poverty. Other important predictors were the rate of violent crimes, life expectancy, mental distress, income inequality, hazardous air quality, prevalence of hypertension, percent of foreign-born citizens, and smoking. This study was limited by the unavailability of data on gestational age. Conclusions: The machine learning algorithms showed excellent performance for the prediction of low birth weight rate in big cities. The identification of influential predictors can help local and state authorities and health policy decision makers to more effectively tackle this important health problem.

Suggested Citation

  • Yulia Treister-Goltzman, 2025. "Predicting Low Birth Weight in Big Cities in the United States Using a Machine Learning Approach," IJERPH, MDPI, vol. 22(6), pages 1-10, June.
  • Handle: RePEc:gam:jijerp:v:22:y:2025:i:6:p:934-:d:1678366
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/22/6/934/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/22/6/934/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jireh Yi-Le Chan & Steven Mun Hong Leow & Khean Thye Bea & Wai Khuen Cheng & Seuk Wai Phoong & Zeng-Wei Hong & Yen-Lin Chen, 2022. "Mitigating the Multicollinearity Problem and Its Machine Learning Approach: A Review," Mathematics, MDPI, vol. 10(8), pages 1-17, April.
    2. Niemesh, Gregory T. & Shester, Katharine L., 2020. "Racial residential segregation and black low birth weight, 1970–2010," Regional Science and Urban Economics, Elsevier, vol. 83(C).
    3. Grossman, Daniel & Khalil, Umair, 2022. "Neighborhood crime and infant health," Journal of Urban Economics, Elsevier, vol. 130(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Exbrayat, Nelly & Stephane, Victor, 2025. "Does urbanization cause crime? Evidence from rural–urban migration in South Africa," Journal of Urban Economics, Elsevier, vol. 149(C).
    2. Nagwan Abdel Samee & Ghada Atteia & Souham Meshoul & Mugahed A. Al-antari & Yasser M. Kadah, 2022. "Deep Learning Cascaded Feature Selection Framework for Breast Cancer Classification: Hybrid CNN with Univariate-Based Approach," Mathematics, MDPI, vol. 10(19), pages 1-27, October.
    3. Du, Yuan & Sun, Lu & Cui, Wei & Wang, Hongxin, 2025. "Ethnic green culture in leadership and corporate green investment: Evidence from China," Global Finance Journal, Elsevier, vol. 67(C).
    4. Zhang, Jianhong & van Witteloostuijn, Arjen & Zhou, Chaohong & Zhou, Shengyang, 2024. "Cross-border acquisition completion by emerging market MNEs revisited: Inductive evidence from a machine learning analysis," Journal of World Business, Elsevier, vol. 59(2).
    5. Nelly Exbrayat & Victor Stephane, 2024. "Does Urbanization Cause Crime? Evidence from Rural-Urban Migration in South Africa," Working Papers halshs-04390026, HAL.
    6. Liu, Yang & Min, Shisheng & Shi, Zhuangbin & He, Mingwei, 2024. "Exploring students' choice of active travel to school in different spatial environments: A case study in a mountain city," Journal of Transport Geography, Elsevier, vol. 115(C).
    7. Wai Khuen Cheng & Khean Thye Bea & Steven Mun Hong Leow & Jireh Yi-Le Chan & Zeng-Wei Hong & Yen-Lin Chen, 2022. "A Review of Sentiment, Semantic and Event-Extraction-Based Approaches in Stock Forecasting," Mathematics, MDPI, vol. 10(14), pages 1-20, July.
    8. Md Samirul Islam & Md Iftakhayrul Islam & Abdul Quddus Mozumder & Md Tamjidul Haq Khan & Niropam Das & Nur Mohammad, 2025. "A Conceptual Framework for Sustainable AI-ERP Integration in Dark Factories: Synthesising TOE, TAM, and IS Success Models for Autonomous Industrial Environments," Sustainability, MDPI, vol. 17(20), pages 1-23, October.
    9. de Bruin, Sophie & Hoch, Jannis & de Bruijn, Jens & Hermans, Kathleen & Maharjan, Amina & Kummu, Matti & van Vliet, Jasper, 2024. "Scenario projections of South Asian migration patterns amidst environmental and socioeconomic change," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 88, pages 1-12.
    10. Mònica González-Carrasco & Silvana Aciar & Ferran Casas & Xavier Oriol & Ramon Fabregat & Sara Malo, 2024. "A Machine Learning Approach to Well-Being in Late Childhood and Early Adolescence: The Children’s Worlds Data Case," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 175(1), pages 25-47, October.
    11. repec:osf:socarx:hyau2_v1 is not listed on IDEAS
    12. Cheng, Louis T.W. & Cheong, Tsun Se & Wojewodzki, Michal & Chui, David, 2025. "The effect of ESG divergence on the financial performance of Hong Kong-listed firms: An artificial neural network approach," Research in International Business and Finance, Elsevier, vol. 73(PA).
    13. Merlino, Luca Paolo & Steinhardt, Max Friedrich & Wren-Lewis, Liam, 2024. "The long run impact of childhood interracial contact on residential segregation," Journal of Public Economics, Elsevier, vol. 239(C).
    14. Zerong Wang, 2022. "The Influence of Ethnic Identity on the Academic Performance of Chinese College Students: An Empirical Study Based on the Administrative Data of a University," Review of Economic Assessment, Anser Press, vol. 1(1), pages 1-21, December.
    15. Xingyao Wang & Ziyi Peng & Xue Yang, 2025. "Multimodal Data-Driven Hourly Dynamic Assessment of Walkability on Urban Streets and Exploration of Regulatory Mechanisms for Diurnal Changes: A Case Study of Wuhan City," Land, MDPI, vol. 14(8), pages 1-30, July.
    16. Cao, Yangfan & Choo, Wei Chong & Matemilola, Bolaji Tunde, 2025. "Value-at-risk forecasting- based on textual information and a hybrid deep learning-based approach," International Review of Economics & Finance, Elsevier, vol. 103(C).
    17. Vu, Cecilia & Arcaya, Mariana C. & Kawachi, Ichiro & Williams, David R., 2023. "Moving to opportunity? Low birth weight outcomes among Southern-born Black mothers during the Great Migration," Social Science & Medicine, Elsevier, vol. 328(C).
    18. Cong Cheng & Jian Dai, 2025. "Predicting Cross-border Merger and Acquisition Completion through CEO Characteristics: A Machine Learning Approach," Management International Review, Springer, vol. 65(1), pages 43-84, February.
    19. You, Geonhwa, 2024. "A comprehensive approach for calibrating anthropogenic effects on atmosphere degradation," Renewable and Sustainable Energy Reviews, Elsevier, vol. 191(C).
    20. Tran Ngoc Mai, 2023. "Renewable Energy, GDP (Gross Domestic Product), FDI (Foreign Direct Investment) and CO2 Emissions in Southeast Asia Countries," International Journal of Energy Economics and Policy, Econjournals, vol. 13(2), pages 284-289, March.
    21. Chiara Berardi & Heidi Wechtler & Madeleine Hinwood & Frederik Schut, 2025. "Comparing the Evolving Dynamics of the Mandatory-Voluntary Financing Mix in OECD Countries: A Composite Measure," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 179(2), pages 593-616, September.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:22:y:2025:i:6:p:934-:d:1678366. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.