IDEAS home Printed from https://ideas.repec.org/a/plo/pmed00/1002674.html
   My bibliography  Save this article

Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study

Author

Listed:
  • Haotian Lin
  • Erping Long
  • Xiaohu Ding
  • Hongxing Diao
  • Zicong Chen
  • Runzhong Liu
  • Jialing Huang
  • Jingheng Cai
  • Shuangjuan Xu
  • Xiayin Zhang
  • Dongni Wang
  • Kexin Chen
  • Tongyong Yu
  • Dongxuan Wu
  • Xutu Zhao
  • Zhenzhen Liu
  • Xiaohang Wu
  • Yuzhen Jiang
  • Xiao Yang
  • Dongmei Cui
  • Wenyan Liu
  • Yingfeng Zheng
  • Lixia Luo
  • Haibo Wang
  • Chi-Chao Chan
  • Ian G Morgan
  • Mingguang He
  • Yizhi Liu

Abstract

Background: Electronic medical records provide large-scale real-world clinical data for use in developing clinical decision systems. However, sophisticated methodology and analytical skills are required to handle the large-scale datasets necessary for the optimisation of prediction accuracy. Myopia is a common cause of vision loss. Current approaches to control myopia progression are effective but have significant side effects. Therefore, identifying those at greatest risk who should undergo targeted therapy is of great clinical importance. The objective of this study was to apply big data and machine learning technology to develop an algorithm that can predict the onset of high myopia, at specific future time points, among Chinese school-aged children. Methods and findings: Real-world clinical refraction data were derived from electronic medical record systems in 8 ophthalmic centres from January 1, 2005, to December 30, 2015. The variables of age, spherical equivalent (SE), and annual progression rate were used to develop an algorithm to predict SE and onset of high myopia (SE ≤ −6.0 dioptres) up to 10 years in the future. Random forest machine learning was used for algorithm training and validation. Electronic medical records from the Zhongshan Ophthalmic Centre (a major tertiary ophthalmic centre in China) were used as the training set. Ten-fold cross-validation and out-of-bag (OOB) methods were applied for internal validation. The remaining 7 independent datasets were used for external validation. Two population-based datasets, which had no participant overlap with the ophthalmic-centre-based datasets, were used for multi-resource validation testing. The main outcomes and measures were the area under the curve (AUC) values for predicting the onset of high myopia over 10 years and the presence of high myopia at 18 years of age. In total, 687,063 multiple visit records (≥3 records) of 129,242 individuals in the ophthalmic-centre-based electronic medical record databases and 17,113 follow-up records of 3,215 participants in population-based cohorts were included in the analysis. Our algorithm accurately predicted the presence of high myopia in internal validation (the AUC ranged from 0.903 to 0.986 for 3 years, 0.875 to 0.901 for 5 years, and 0.852 to 0.888 for 8 years), external validation (the AUC ranged from 0.874 to 0.976 for 3 years, 0.847 to 0.921 for 5 years, and 0.802 to 0.886 for 8 years), and multi-resource testing (the AUC ranged from 0.752 to 0.869 for 4 years). With respect to the prediction of high myopia development by 18 years of age, as a surrogate of high myopia in adulthood, the algorithm provided clinically acceptable accuracy over 3 years (the AUC ranged from 0.940 to 0.985), 5 years (the AUC ranged from 0.856 to 0.901), and even 8 years (the AUC ranged from 0.801 to 0.837). Meanwhile, our algorithm achieved clinically acceptable prediction of the actual refraction values at future time points, which is supported by the regressive performance and calibration curves. Although the algorithm achieved balanced and robust performance, concerns about the compromised quality of real-world clinical data and over-fitting issues should be cautiously considered. Conclusions: To our knowledge, this study, for the first time, used large-scale data collected from electronic health records to demonstrate the contribution of big data and machine learning approaches to improved prediction of myopia prognosis in Chinese school-aged children. This work provides evidence for transforming clinical practice, health policy-making, and precise individualised interventions regarding the practical control of school-aged myopia. Therapies to control myopia progression confer significant side effects and should be targeted to those at highest risk. Here, Yizhi Liu and colleagues report a machine learning algorithm that predicts the progression of myopia, into early adulthood, among Chinese school-aged children.Why was this study done?: What did the researchers do and find?: What do these findings mean?:

Suggested Citation

  • Haotian Lin & Erping Long & Xiaohu Ding & Hongxing Diao & Zicong Chen & Runzhong Liu & Jialing Huang & Jingheng Cai & Shuangjuan Xu & Xiayin Zhang & Dongni Wang & Kexin Chen & Tongyong Yu & Dongxuan W, 2018. "Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study," PLOS Medicine, Public Library of Science, vol. 15(11), pages 1-17, November.
  • Handle: RePEc:plo:pmed00:1002674
    DOI: 10.1371/journal.pmed.1002674
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosmedicine/article?id=10.1371/journal.pmed.1002674
    Download Restriction: no

    File URL: https://journals.plos.org/plosmedicine/article/file?id=10.1371/journal.pmed.1002674&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pmed.1002674?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pmed00:1002674. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosmedicine (email available below). General contact details of provider: https://journals.plos.org/plosmedicine/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.