IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0078927.html
   My bibliography  Save this article

Modeling Disease Severity in Multiple Sclerosis Using Electronic Health Records

Author

Listed:
  • Zongqi Xia
  • Elizabeth Secor
  • Lori B Chibnik
  • Riley M Bove
  • Suchun Cheng
  • Tanuja Chitnis
  • Andrew Cagan
  • Vivian S Gainer
  • Pei J Chen
  • Katherine P Liao
  • Stanley Y Shaw
  • Ashwin N Ananthakrishnan
  • Peter Szolovits
  • Howard L Weiner
  • Elizabeth W Karlson
  • Shawn N Murphy
  • Guergana K Savova
  • Tianxi Cai
  • Susanne E Churchill
  • Robert M Plenge
  • Isaac S Kohane
  • Philip L De Jager

Abstract

Objective: To optimally leverage the scalability and unique features of the electronic health records (EHR) for research that would ultimately improve patient care, we need to accurately identify patients and extract clinically meaningful measures. Using multiple sclerosis (MS) as a proof of principle, we showcased how to leverage routinely collected EHR data to identify patients with a complex neurological disorder and derive an important surrogate measure of disease severity heretofore only available in research settings. Methods: In a cross-sectional observational study, 5,495 MS patients were identified from the EHR systems of two major referral hospitals using an algorithm that includes codified and narrative information extracted using natural language processing. In the subset of patients who receive neurological care at a MS Center where disease measures have been collected, we used routinely collected EHR data to extract two aggregate indicators of MS severity of clinical relevance multiple sclerosis severity score (MSSS) and brain parenchymal fraction (BPF, a measure of whole brain volume). Results: The EHR algorithm that identifies MS patients has an area under the curve of 0.958, 83% sensitivity, 92% positive predictive value, and 89% negative predictive value when a 95% specificity threshold is used. The correlation between EHR-derived and true MSSS has a mean R2 = 0.38±0.05, and that between EHR-derived and true BPF has a mean R2 = 0.22±0.08. To illustrate its clinical relevance, derived MSSS captures the expected difference in disease severity between relapsing-remitting and progressive MS patients after adjusting for sex, age of symptom onset and disease duration (p = 1.56×10−12). Conclusion: Incorporation of sophisticated codified and narrative EHR data accurately identifies MS patients and provides estimation of a well-accepted indicator of MS severity that is widely used in research settings but not part of the routine medical records. Similar approaches could be applied to other complex neurological disorders.

Suggested Citation

  • Zongqi Xia & Elizabeth Secor & Lori B Chibnik & Riley M Bove & Suchun Cheng & Tanuja Chitnis & Andrew Cagan & Vivian S Gainer & Pei J Chen & Katherine P Liao & Stanley Y Shaw & Ashwin N Ananthakrishna, 2013. "Modeling Disease Severity in Multiple Sclerosis Using Electronic Health Records," PLOS ONE, Public Library of Science, vol. 8(11), pages 1-9, November.
  • Handle: RePEc:plo:pone00:0078927
    DOI: 10.1371/journal.pone.0078927
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0078927
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0078927&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0078927?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jessica Gronsbell & Jessica Minnier & Sheng Yu & Katherine Liao & Tianxi Cai, 2019. "Automated feature selection of predictors in electronic medical records data," Biometrics, The International Biometric Society, vol. 75(1), pages 268-277, March.
    2. Katherine P Liao & Ashwin N Ananthakrishnan & Vishesh Kumar & Zongqi Xia & Andrew Cagan & Vivian S Gainer & Sergey Goryachev & Pei Chen & Guergana K Savova & Denis Agniel & Susanne Churchill & Jaeyoun, 2015. "Methods to Develop an Electronic Medical Record Phenotype Algorithm to Compare the Risk of Coronary Artery Disease across 3 Chronic Disease Cohorts," PLOS ONE, Public Library of Science, vol. 10(8), pages 1-11, August.
    3. Jessica Gronsbell & Molei Liu & Lu Tian & Tianxi Cai, 2022. "Efficient evaluation of prediction rules in semi‐supervised settings under stratified sampling," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(4), pages 1353-1391, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0078927. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.