IDEAS home Printed from https://ideas.repec.org/a/axf/gbppsa/v17y2025ip241-248.html

Research on Named Entity Recognition of Chinese Electronic Medical Records Based on the BERT-BiLSTM-CRF Model

Author

Listed:
  • Ren, Qian
  • Zhang, Weiwei

Abstract

To address the problems of blurred entity boundaries, nested structures, and semantic ambiguity in Chinese electronic medical records (EMRs), this paper proposes a named entity recognition (NER) method based on the BERT-BiLSTM-CRF model. The model utilizes the BERT pre-trained language model to capture contextual semantic information, employs a Bidirectional Long Short-Term Memory (BiLSTM) network to further extract sequence features, and integrates a Conditional Random Field (CRF) layer for global label optimization, thereby improving the accuracy of medical entity recognition. The experimental data are derived from real clinical cases in the Anorectal Department of Xi'an Hospital of Traditional Chinese Medicine, comprising a total of 5,947 text records. After regular expression cleaning, word segmentation, and manual annotation, the dataset was divided into training, validation, and test sets in an 8:1:1 ratio. Experimental results show that the proposed model achieves superior performance in recognizing multiple categories of medical entities-such as diseases, symptoms, syndromes, and prescriptions-with an overall precision of 92.37%, recall of 94.25%, and F1-score of 93.29%. Compared with CNN-CRF and traditional BiLSTM-CRF models, the proposed approach demonstrates higher accuracy and robustness in Chinese EMR entity recognition.

Suggested Citation

  • Ren, Qian & Zhang, Weiwei, 2025. "Research on Named Entity Recognition of Chinese Electronic Medical Records Based on the BERT-BiLSTM-CRF Model," GBP Proceedings Series, Scientific Open Access Publishing, vol. 17, pages 241-248.
  • Handle: RePEc:axf:gbppsa:v:17:y:2025:i::p:241-248
    as

    Download full text from publisher

    File URL: https://soapubs.com/index.php/GBPPS/article/view/1041/1023
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:axf:gbppsa:v:17:y:2025:i::p:241-248. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Yuchi Liu (email available below). General contact details of provider: https://soapubs.com/index.php/GBPPS .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.