IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v19y2022i9p5601-d808682.html
   My bibliography  Save this article

Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine

Author

Listed:
  • Chenyuan Hu

    (School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China)

  • Shuoyan Zhang

    (School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China)

  • Tianyu Gu

    (School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China)

  • Zhuangzhi Yan

    (Institute of Biomedical Engineering, School of Life Science, Shanghai University, Shanghai 200444, China)

  • Jiehui Jiang

    (Institute of Biomedical Engineering, School of Life Science, Shanghai University, Shanghai 200444, China)

Abstract

Evidence-based treatment is the basis of traditional Chinese medicine (TCM), and the accurate differentiation of syndromes is important for treatment in this context. The automatic differentiation of syndromes of unstructured medical records requires two important steps: Chinese word segmentation and text classification. Due to the ambiguity of the Chinese language and the peculiarities of syndrome differentiation, these tasks pose a daunting challenge. We use text classification to model syndrome differentiation for TCM, and use multi-task learning (MTL) and deep learning to accomplish the two challenging tasks of Chinese word segmentation and syndrome differentiation. Two classic deep neural networks—bidirectional long short-term memory (Bi-LSTM) and text-based convolutional neural networks (TextCNN)—are fused into MTL to simultaneously carry out these two tasks. We used our proposed method to conduct a large number of comparative experiments. The experimental comparisons showed that it was superior to other methods on both tasks. Our model yielded values of accuracy, specificity, and sensitivity of 0.93, 0.94, and 0.90, and 0.80, 0.82, and 0.78 on the Chinese word segmentation task and the syndrome differentiation task, respectively. Moreover, statistical analyses showed that the accuracies of the non-joint and joint models were both within the 95% confidence interval, with p v a l u e < 0.05. The experimental comparison showed that our method is superior to prevalent methods on both tasks. The work here can help modernize TCM through intelligent differentiation.

Suggested Citation

  • Chenyuan Hu & Shuoyan Zhang & Tianyu Gu & Zhuangzhi Yan & Jiehui Jiang, 2022. "Multi-Task Joint Learning Model for Chinese Word Segmentation and Syndrome Differentiation in Traditional Chinese Medicine," IJERPH, MDPI, vol. 19(9), pages 1-13, May.
  • Handle: RePEc:gam:jijerp:v:19:y:2022:i:9:p:5601-:d:808682
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/19/9/5601/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/19/9/5601/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. David Cyranoski, 2018. "Why Chinese medicine is heading for clinics around the world," Nature, Nature, vol. 561(7724), pages 448-450, September.
    2. Fortunato Pesarin & Luigi Salmaso, 2010. "The permutation testing approach: a review," Statistica, Department of Statistics, University of Bologna, vol. 70(4), pages 481-509.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Senqi Yang & Xuliang Duan & Zeyan Xiao & Zhiyao Li & Yuhai Liu & Zhihao Jie & Dezhao Tang & Hui Du, 2022. "Sentiment Classification of Chinese Tourism Reviews Based on ERNIE-Gram+GCN," IJERPH, MDPI, vol. 19(20), pages 1-20, October.
    2. Shangyi Yan & Jingya Wang & Zhiqiang Song, 2022. "Microblog Sentiment Analysis Based on Dynamic Character-Level and Word-Level Features and Multi-Head Self-Attention Pooling," Future Internet, MDPI, vol. 14(8), pages 1-19, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Roman Tikhonov & Aleksey Masyutin & Vadim Anpilogov, 2021. "The Relationship Between the Financial Performance of Banks and the Quality of Credit Scoring Models," Russian Journal of Money and Finance, Bank of Russia, vol. 80(2), pages 76-95, June.
    2. Demuynck, Thomas & Salman, Umutcan, 2022. "On the revealed preference analysis of stable aggregate matchings," Theoretical Economics, Econometric Society, vol. 17(4), November.
    3. Antonio D’Ambrosio & Sonia Amodio & Carmela Iorio & Giuseppe Pandolfo & Roberta Siciliano, 2021. "Adjusted Concordance Index: an Extensionl of the Adjusted Rand Index to Fuzzy Partitions," Journal of Classification, Springer;The Classification Society, vol. 38(1), pages 112-128, April.
    4. Dong, Lei & Tao, Junyu & Zhang, Zhaoling & Yan, Beibei & Cheng, Zhanjun & Chen, Guanyi, 2021. "Energy utilization and disposal of herb residue by an integrated energy conversion system: A pilot scale study," Energy, Elsevier, vol. 215(PB).
    5. Laurens Cherchye & Thomas Demuynck & Bram De Rock & Joshua Lanier, 2020. "Are Consumers Rational ?Shifting the Burden of Proof," Working Papers ECARES 2020-19, ULB -- Universite Libre de Bruxelles.
    6. Virginie Rozée & Sayeed Unisa & Elise de La Rochebrochard, 2019. "Sociodemographic characteristics of 96 Indian surrogates: Are they disadvantaged compared with the general population?," PLOS ONE, Public Library of Science, vol. 14(3), pages 1-9, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:19:y:2022:i:9:p:5601-:d:808682. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.