IDEAS home Printed from https://ideas.repec.org/a/spr/drugsa/v45y2022i8d10.1007_s40264-022-01196-x.html
   My bibliography  Save this article

Combining Machine Learning with a Rule-Based Algorithm to Detect and Identify Related Entities of Documented Adverse Drug Reactions on Hospital Discharge Summaries

Author

Listed:
  • Hui Xing Tan

    (Health Sciences Authority)

  • Chun Hwee Desmond Teo

    (Health Sciences Authority)

  • Pei San Ang

    (Health Sciences Authority)

  • Wei Ping Celine Loke

    (Health Sciences Authority)

  • Mun Yee Tham

    (Health Sciences Authority)

  • Siew Har Tan

    (Health Sciences Authority)

  • Bee Leng Sally Soh

    (Health Sciences Authority)

  • Pei Qin Belinda Foo

    (Health Sciences Authority)

  • Zheng Jye Ling

    (Regional Health System Office, National University of Singapore, National University Health System)

  • Wei Luen James Yip

    (National University Heart Centre
    National University Health System)

  • Yixuan Tang

    (National University of Singapore)

  • Jisong Yang

    (National University of Singapore)

  • Kum Hoe Anthony Tung

    (National University of Singapore)

  • Sreemanee Raaj Dorajoo

    (Health Sciences Authority)

Abstract

Introduction Discharge summaries contain valuable information about adverse drug reactions, but their unstructured nature makes them challenging to analyse and use as a signal source for pharmacovigilance. Machine learning has shown promise in identifying discharge summaries that contain related drug-adverse event pairs but has fared relatively poorer in entity extraction. Methods A hybrid model is developed combining rule-based and machine learning algorithms using discharge summaries with the aim of maximising capture of related drug-adverse event pairs. The rule first identifies segments containing adverse event entities within a 100-character distance from a drug term; machine learning subsequently estimates the relatedness of the drug and adverse event entities contained. The approach is validated on four independent datasets that are temporally and geographically separated from model development data. The impact of restricted drug-adverse event pair detection on recall is evaluated by using two of the four validation datasets that do not impose rule-based restrictions to annotations. Results The hybrid model achieves a recall of 0.80 (fivefold cross validation), 0.80 (temporal) and 0.76 (geographical) on validation using datasets containing only pre-identified target text segments that fulfil the rule-based algorithm criteria. When tested on datasets that additionally contained drug-adverse event pairs not restricted by the rule-based criteria, recall of the model declines to 0.68 and 0.62 on temporally and geographically separated datasets, respectively. Conclusions The proposed hybrid model demonstrates reasonable generalisability on external validation. Rule-based restriction of the detection space results in an approximately 12–14% reduction in recall but improves identification of the related drug and adverse event terms.

Suggested Citation

  • Hui Xing Tan & Chun Hwee Desmond Teo & Pei San Ang & Wei Ping Celine Loke & Mun Yee Tham & Siew Har Tan & Bee Leng Sally Soh & Pei Qin Belinda Foo & Zheng Jye Ling & Wei Luen James Yip & Yixuan Tang &, 2022. "Combining Machine Learning with a Rule-Based Algorithm to Detect and Identify Related Entities of Documented Adverse Drug Reactions on Hospital Discharge Summaries," Drug Safety, Springer, vol. 45(8), pages 853-862, August.
  • Handle: RePEc:spr:drugsa:v:45:y:2022:i:8:d:10.1007_s40264-022-01196-x
    DOI: 10.1007/s40264-022-01196-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s40264-022-01196-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s40264-022-01196-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Abhyuday Jagannatha & Feifan Liu & Weisong Liu & Hong Yu, 2019. "Overview of the First Natural Language Processing Challenge for Extracting Medication, Indication, and Adverse Drug Events from Electronic Health Record Notes (MADE 1.0)," Drug Safety, Springer, vol. 42(1), pages 99-111, January.
    2. Xi Yang & Jiang Bian & Yan Gong & William R. Hogan & Yonghui Wu, 2019. "MADEx: A System for Detecting Medications, Adverse Drug Events, and Their Relations from Clinical Notes," Drug Safety, Springer, vol. 42(1), pages 123-133, January.
    3. Alec B. Chapman & Kelly S. Peterson & Patrick R. Alba & Scott L. DuVall & Olga V. Patterson, 2019. "Detecting Adverse Drug Events with Rapidly Trained Classification Models," Drug Safety, Springer, vol. 42(1), pages 147-156, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Feifan Liu & Abhyuday Jagannatha & Hong Yu, 2019. "Towards Drug Safety Surveillance and Pharmacovigilance: Current Progress in Detecting Medication and Adverse Drug Events from Electronic Health Records," Drug Safety, Springer, vol. 42(1), pages 95-97, January.
    2. Likeng Liang & Jifa Hu & Gang Sun & Na Hong & Ge Wu & Yuejun He & Yong Li & Tianyong Hao & Li Liu & Mengchun Gong, 2022. "Artificial Intelligence-Based Pharmacovigilance in the Setting of Limited Resources," Drug Safety, Springer, vol. 45(5), pages 511-519, May.
    3. Abhyuday Jagannatha & Feifan Liu & Weisong Liu & Hong Yu, 2019. "Overview of the First Natural Language Processing Challenge for Extracting Medication, Indication, and Adverse Drug Events from Electronic Health Record Notes (MADE 1.0)," Drug Safety, Springer, vol. 42(1), pages 99-111, January.
    4. Siun Kim & Taegwan Kang & Tae Kyu Chung & Yoona Choi & YeSol Hong & Kyomin Jung & Howard Lee, 2023. "Automatic Extraction of Comprehensive Drug Safety Information from Adverse Drug Event Narratives in the Korea Adverse Event Reporting System Using Natural Language Processing Techniques," Drug Safety, Springer, vol. 46(8), pages 781-795, August.
    5. Heba Edrees & Wenyu Song & Ania Syrowatka & Aurélien Simona & Mary G. Amato & David W. Bates, 2022. "Intelligent Telehealth in Pharmacovigilance: A Future Perspective," Drug Safety, Springer, vol. 45(5), pages 449-458, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:drugsa:v:45:y:2022:i:8:d:10.1007_s40264-022-01196-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com/economics/journal/40264 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.