IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v16y2019i22p4360-d284870.html
   My bibliography  Save this article

Comparison of Word Embeddings for Extraction from Medical Records

Author

Listed:
  • Aleksei Dudchenko

    (National Center for Cognitive Technologies, ITMO University, 197101 Saint-Petersburg, Russia
    Institute of Medical Biometry and Informatics, Heidelberg University, 69120 Heidelberg, Germany)

  • Georgy Kopanitsa

    (National Center for Cognitive Technologies, ITMO University, 197101 Saint-Petersburg, Russia)

Abstract

This paper is an extension of the work originally presented in the 16th International Conference on Wearable, Micro and Nano Technologies for Personalized Health. Despite using electronic medical records, free narrative text is still widely used for medical records. To make data from texts available for decision support systems, supervised machine learning algorithms might be successfully applied. In this work, we developed and compared a prototype of a medical data extraction system based on different artificial neural network architectures to process free medical texts in the Russian language. Three classifiers were applied to extract entities from snippets of text. Multi-layer perceptron (MLP) and convolutional neural network (CNN) classifiers showed similar results to all three embedding models. MLP exceeded convolutional network on pipelines that used the embedding model trained on medical records with preliminary lemmatization. Nevertheless, the highest F-score was achieved by CNN. CNN slightly exceeded MLP when the biggest word2vec model was applied (F-score 0.9763).

Suggested Citation

  • Aleksei Dudchenko & Georgy Kopanitsa, 2019. "Comparison of Word Embeddings for Extraction from Medical Records," IJERPH, MDPI, vol. 16(22), pages 1-8, November.
  • Handle: RePEc:gam:jijerp:v:16:y:2019:i:22:p:4360-:d:284870
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/16/22/4360/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/16/22/4360/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:16:y:2019:i:22:p:4360-:d:284870. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.