IDEAS home Printed from https://ideas.repec.org/a/plo/pdig00/0000354.html
   My bibliography  Save this article

Artificial intelligence with temporal features outperforms machine learning in predicting diabetes

Author

Listed:
  • Iqra Naveed
  • Muhammad Farhat Kaleem
  • Karim Keshavjee
  • Aziz Guergachi

Abstract

Diabetes mellitus type 2 is increasingly being called a modern preventable pandemic, as even with excellent available treatments, the rate of complications of diabetes is rapidly increasing. Predicting diabetes and identifying it in its early stages could make it easier to prevent, allowing enough time to implement therapies before it gets out of control. Leveraging longitudinal electronic medical record (EMR) data with deep learning has great potential for diabetes prediction. This paper examines the predictive competency of deep learning models in contrast to state-of-the-art machine learning models to incorporate the time dimension of risk. The proposed research investigates a variety of deep learning models and features for predicting diabetes. Model performance was appraised and compared in relation to predominant features, risk factors, training data density and visit history. The framework was implemented on the longitudinal EMR records of over 19K patients extracted from the Canadian Primary Care Sentinel Surveillance Network (CPCSSN). Empirical findings demonstrate that deep learning models consistently outperform other state-of-the-art competitors with prediction accuracy of above 91%, without overfitting. Fasting blood sugar, hemoglobin A1c and body mass index are the key predictors of future onset of diabetes. Overweight, middle aged patients and patients with hypertension are more vulnerable to developing diabetes, consistent with what is already known. Model performance improves as training data density or the visit history of a patient increases. This study confirms the ability of the LSTM deep learning model to incorporate the time dimension of risk in its predictive capabilities.Author summary: Diabetes is a growing problem around the world and yet it is preventable. A small percentage of people are at higher risk of developing diabetes. Detecting those at highest risk early and offering them early treatment could go a long way to slowing down the growth of diabetes and reverse the trend of severe complications of diabetes. One of the barriers to early detection is our inability to take into account the risk that accumulates over time. Someone who has had elevated blood sugar for 5 years has more risk than someone who has only had it for 1 year, yet all current prediction models only take into account the blood sugar and not the time element. This paper reports on our research with artificial intelligence models that can take into account the time element of risk.

Suggested Citation

  • Iqra Naveed & Muhammad Farhat Kaleem & Karim Keshavjee & Aziz Guergachi, 2023. "Artificial intelligence with temporal features outperforms machine learning in predicting diabetes," PLOS Digital Health, Public Library of Science, vol. 2(10), pages 1-17, October.
  • Handle: RePEc:plo:pdig00:0000354
    DOI: 10.1371/journal.pdig.0000354
    as

    Download full text from publisher

    File URL: https://journals.plos.org/digitalhealth/article?id=10.1371/journal.pdig.0000354
    Download Restriction: no

    File URL: https://journals.plos.org/digitalhealth/article/file?id=10.1371/journal.pdig.0000354&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pdig.0000354?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. repec:plo:pone00:0146917 is not listed on IDEAS
    2. Saqib E Awan & Mohammed Bennamoun & Ferdous Sohel & Frank M Sanfilippo & Benjamin J Chow & Girish Dwivedi, 2019. "Feature selection and transformation by machine learning reduce variable numbers and improve prediction for heart failure readmission or death," PLOS ONE, Public Library of Science, vol. 14(6), pages 1-13, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Saurabh Shukla & Mohd Fadzil Hassan & Muhammad Khalid Khan & Low Tang Jung & Azlan Awang, 2019. "An analytical model to minimize the latency in healthcare internet-of-things in fog computing environment," PLOS ONE, Public Library of Science, vol. 14(11), pages 1-31, November.
    2. Shan, Shan, 2024. "Unveiling Socioeconomic Factors Shaping Global Food Prices and Security: A Machine Learning Approach," Asian Journal of Agriculture and Development, Southeast Asian Regional Center for Graduate Study and Research in Agriculture (SEARCA), vol. 21(2), December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pdig00:0000354. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: digitalhealth (email available below). General contact details of provider: https://journals.plos.org/digitalhealth .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.