IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0226718.html
   My bibliography  Save this article

Development and validation of a prognostic model predicting symptomatic hemorrhagic transformation in acute ischemic stroke at scale in the OHDSI network

Author

Listed:
  • Qiong Wang
  • Jenna M Reps
  • Kristin Feeney Kostka
  • Patrick B Ryan
  • Yuhui Zou
  • Erica A Voss
  • Peter R Rijnbeek
  • RuiJun Chen
  • Gowtham A Rao
  • Henry Morgan Stewart
  • Andrew E Williams
  • Ross D Williams
  • Mui Van Zandt
  • Thomas Falconer
  • Margarita Fernandez-Chas
  • Rohit Vashisht
  • Stephen R Pfohl
  • Nigam H Shah
  • Suranga N Kasthurirathne
  • Seng Chan You
  • Qing Jiang
  • Christian Reich
  • Yi Zhou

Abstract

Background and purpose: Hemorrhagic transformation (HT) after cerebral infarction is a complex and multifactorial phenomenon in the acute stage of ischemic stroke, and often results in a poor prognosis. Thus, identifying risk factors and making an early prediction of HT in acute cerebral infarction contributes not only to the selections of therapeutic regimen but also, more importantly, to the improvement of prognosis of acute cerebral infarction. The purpose of this study was to develop and validate a model to predict a patient’s risk of HT within 30 days of initial ischemic stroke. Methods: We utilized a retrospective multicenter observational cohort study design to develop a Lasso Logistic Regression prediction model with a large, US Electronic Health Record dataset which structured to the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). To examine clinical transportability, the model was externally validated across 10 additional real-world healthcare datasets include EHR records for patients from America, Europe and Asia. Results: In the database the model was developed, the target population cohort contained 621,178 patients with ischemic stroke, of which 5,624 patients had HT within 30 days following initial ischemic stroke. 612 risk predictors, including the distance a patient travels in an ambulance to get to care for a HT, were identified. An area under the receiver operating characteristic curve (AUC) of 0.75 was achieved in the internal validation of the risk model. External validation was performed across 10 databases totaling 5,515,508 patients with ischemic stroke, of which 86,401 patients had HT within 30 days following initial ischemic stroke. The mean external AUC was 0.71 and ranged between 0.60–0.78. Conclusions: A HT prognostic predict model was developed with Lasso Logistic Regression based on routinely collected EMR data. This model can identify patients who have a higher risk of HT than the population average with an AUC of 0.78. It shows the OMOP CDM is an appropriate data standard for EMR secondary use in clinical multicenter research for prognostic prediction model development and validation. In the future, combining this model with clinical information systems will assist clinicians to make the right therapy decision for patients with acute ischemic stroke.

Suggested Citation

  • Qiong Wang & Jenna M Reps & Kristin Feeney Kostka & Patrick B Ryan & Yuhui Zou & Erica A Voss & Peter R Rijnbeek & RuiJun Chen & Gowtham A Rao & Henry Morgan Stewart & Andrew E Williams & Ross D Willi, 2020. "Development and validation of a prognostic model predicting symptomatic hemorrhagic transformation in acute ischemic stroke at scale in the OHDSI network," PLOS ONE, Public Library of Science, vol. 15(1), pages 1-12, January.
  • Handle: RePEc:plo:pone00:0226718
    DOI: 10.1371/journal.pone.0226718
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0226718
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0226718&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0226718?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jenna Wong & Daniel Prieto-Alhambra & Peter R. Rijnbeek & Rishi J. Desai & Jenna M. Reps & Sengwee Toh, 2022. "Applying Machine Learning in Distributed Data Networks for Pharmacoepidemiologic and Pharmacovigilance Studies: Opportunities, Challenges, and Considerations," Drug Safety, Springer, vol. 45(5), pages 493-510, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0226718. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.