IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v20y2023i3p1732-d1039277.html
   My bibliography  Save this article

Associations of Preterm Birth with Dental and Gastrointestinal Diseases: Machine Learning Analysis Using National Health Insurance Data

Author

Listed:
  • In-Seok Song

    (Department of Oral and Maxillofacial Surgery, Korea University College of Medicine, Korea University Anam Hospital, Seoul 02841, Republic of Korea
    These authors contributed equally to this work.)

  • Eun-Saem Choi

    (Department of Obstetrics and Gynecology, Korea University College of Medicine, Korea University Anam Hospital, Seoul 02841, Republic of Korea
    These authors contributed equally to this work.)

  • Eun Sun Kim

    (Department of Gastroenterology, Korea University College of Medicine, Korea University Anam Hospital, Seoul 02841, Republic of Korea)

  • Yujin Hwang

    (Department of Statistics, Korea University College of Political Science & Economics, Korea University Anam Hospital, Seoul 02841, Republic of Korea)

  • Kwang-Sig Lee

    (AI Center, Korea University College of Medicine, Korea University Anam Hospital, Seoul 02841, Republic of Korea)

  • Ki Hoon Ahn

    (Department of Obstetrics and Gynecology, Korea University College of Medicine, Korea University Anam Hospital, Seoul 02841, Republic of Korea)

Abstract

Background: This study uses machine learning with large-scale population data to assess the associations of preterm birth (PTB) with dental and gastrointestinal diseases. Methods: Population-based retrospective cohort data came from Korea National Health Insurance claims for 124,606 primiparous women aged 25–40 and delivered in 2017. The 186 independent variables included demographic/socioeconomic determinants, disease information, and medication history. Machine learning analysis was used to establish the prediction model of PTB. Random forest variable importance was used for identifying major determinants of PTB and testing its associations with dental and gastrointestinal diseases, medication history, and socioeconomic status. Results: The random forest with oversampling data registered an accuracy of 84.03, and the areas under the receiver-operating-characteristic curves with the range of 84.03–84.04. Based on random forest variable importance with oversampling data, PTB has strong associations with socioeconomic status (0.284), age (0.214), year 2014 gastroesophageal reflux disease (GERD) (0.026), year 2015 GERD (0.026), year 2013 GERD (0.024), progesterone (0.024), year 2012 GERD (0.023), year 2011 GERD (0.021), tricyclic antidepressant (0.020) and year 2016 infertility (0.019). For example, the accuracy of the model will decrease by 28.4%, 2.6%, or 1.9% if the values of socioeconomic status, year 2014 GERD, or year 2016 infertility are randomly permutated (or shuffled). Conclusion: By using machine learning, we established a valid prediction model for PTB. PTB has strong associations with GERD and infertility. Pregnant women need close surveillance for gastrointestinal and obstetric risks at the same time.

Suggested Citation

  • In-Seok Song & Eun-Saem Choi & Eun Sun Kim & Yujin Hwang & Kwang-Sig Lee & Ki Hoon Ahn, 2023. "Associations of Preterm Birth with Dental and Gastrointestinal Diseases: Machine Learning Analysis Using National Health Insurance Data," IJERPH, MDPI, vol. 20(3), pages 1-9, January.
  • Handle: RePEc:gam:jijerp:v:20:y:2023:i:3:p:1732-:d:1039277
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/20/3/1732/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/20/3/1732/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:20:y:2023:i:3:p:1732-:d:1039277. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.