IDEAS home Printed from https://ideas.repec.org/a/plo/pdig00/0001230.html

Portability of a text mining algorithm for detecting adverse drug reactions in electronic health records across diverse patient groups in two Dutch hospitals

Author

Listed:
  • Britt W M van de Burgt
  • Loes F C van Dijck
  • Bjorn Dullemond
  • Naomi T Jessurun
  • Minou van Seyen
  • Rob J van Marum
  • Remco J A van Wensen
  • Wai-Yan Liu
  • Carolien M J van der Linden
  • Rene J E Grouls
  • R Arthur Bouwman
  • Erik H M Korsten
  • Toine C G Egberts

Abstract

Adverse Drug Reactions (ADRs) pose a significant challenge in healthcare. While structured documentation of ADRs in electronic health records (EHRs) enables automated alerting, many ADRs are recorded as unstructured free-text, limiting detection. Text mining (TM) shows potential for extracting clinically relevant data from unstructured text. However, the portability of TM algorithms across different institutions and departments remains uncertain, due to variations in EHR structures and documentation practices. To enhance these general-purpose algorithms, evaluating their portability is essential for ensuring effective performance across diverse clinical settings. To evaluate the portability of a previously developed TM-based ADR identification algorithm by assessing its performance using EHRs from two different departments in two different hospitals. EHR free-text data from 62 hospitalized patients in the geriatric and orthopedic departments of two Dutch teaching hospitals were reviewed for ADRs via manual review and the TM algorithm. Performance was evaluated using F-score, sensitivity and positive predictive value (PPV), with comparisons across hospitals and departments. Manual review identified 359 unique ADRs. The TM algorithm detected 534 potential ADRs (pADRs), 286 of which overlapped with manual review, yielding an F-score of 0.64, sensitivity of 80% and PPV of 54%. Performance was consistent across hospitals and departments. Notably, 26 pADRs identified by the algorithm were clinically relevant yet missed in manual review. This study demonstrates portability of the TM algorithm by identifying pADRs across different hospitals and departments without adaptations. These findings support its broader implementation potential for ADR detection in diverse healthcare settings.Author summary: Adverse Drug Reactions (ADRs) present a significant challenge in healthcare, with many being recorded as unstructured free-text in Electronic Health Records (EHRs). This study evaluates the portability of a text mining (TM) algorithm developed for ADR identification in EHRs, by assessing its performance across two departments in two Dutch hospitals. EHR free-text data from 62 patients in the geriatric and orthopedic departments were analyzed using manual review and the TM algorithm. The results showed that the TM algorithm demonstrated a good performance, with an F-score of 0.64, sensitivity of 80%, and positive predictive value (PPV) of 54%. Additionally, the algorithm identified clinically relevant ADRs that were missed in manual review. These findings suggest that the TM algorithm is portable across different clinical settings without requiring adaptation, highlighting its potential for broader implementation in ADR detection.

Suggested Citation

  • Britt W M van de Burgt & Loes F C van Dijck & Bjorn Dullemond & Naomi T Jessurun & Minou van Seyen & Rob J van Marum & Remco J A van Wensen & Wai-Yan Liu & Carolien M J van der Linden & Rene J E Groul, 2026. "Portability of a text mining algorithm for detecting adverse drug reactions in electronic health records across diverse patient groups in two Dutch hospitals," PLOS Digital Health, Public Library of Science, vol. 5(2), pages 1-14, February.
  • Handle: RePEc:plo:pdig00:0001230
    DOI: 10.1371/journal.pdig.0001230
    as

    Download full text from publisher

    File URL: https://journals.plos.org/digitalhealth/article?id=10.1371/journal.pdig.0001230
    Download Restriction: no

    File URL: https://journals.plos.org/digitalhealth/article/file?id=10.1371/journal.pdig.0001230&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pdig.0001230?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pdig00:0001230. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: digitalhealth (email available below). General contact details of provider: https://journals.plos.org/digitalhealth .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.