Reliable Multi-Label Learning via Conformal Predictor and Random Forest for Syndrome Differentiation of Chronic Fatigue in Traditional Chinese Medicine

Reliable Multi-Label Learning via Conformal Predictor and Random Forest for Syndrome Differentiation of Chronic Fatigue in Traditional Chinese Medicine

Author

Listed:

Huazhen Wang
Xin Liu
Bing Lv
Fan Yang
Yanzhu Hong

Abstract

Objective: Chronic Fatigue (CF) still remains unclear about its etiology, pathophysiology, nomenclature and diagnostic criteria in the medical community. Traditional Chinese medicine (TCM) adopts a unique diagnostic method, namely ‘bian zheng lun zhi’ or syndrome differentiation, to diagnose the CF with a set of syndrome factors, which can be regarded as the Multi-Label Learning (MLL) problem in the machine learning literature. To obtain an effective and reliable diagnostic tool, we use Conformal Predictor (CP), Random Forest (RF) and Problem Transformation method (PT) for the syndrome differentiation of CF. Methods and Materials: In this work, using PT method, CP-RF is extended to handle MLL problem. CP-RF applies RF to measure the confidence level (p-value) of each label being the true label, and then selects multiple labels whose p-values are larger than the pre-defined significance level as the region prediction. In this paper, we compare the proposed CP-RF with typical CP-NBC(Naïve Bayes Classifier), CP-KNN(K-Nearest Neighbors) and ML-KNN on CF dataset, which consists of 736 cases. Specifically, 95 symptoms are used to identify CF, and four syndrome factors are employed in the syndrome differentiation, including ‘spleen deficiency’, ‘heart deficiency’, ‘liver stagnation’ and ‘qi deficiency’. The Results: CP-RF demonstrates an outstanding performance beyond CP-NBC, CP-KNN and ML-KNN under the general metrics of subset accuracy, hamming loss, one-error, coverage, ranking loss and average precision. Furthermore, the performance of CP-RF remains steady at the large scale of confidence levels from 80% to 100%, which indicates its robustness to the threshold determination. In addition, the confidence evaluation provided by CP is valid and well-calibrated. Conclusion: CP-RF not only offers outstanding performance but also provides valid confidence evaluation for the CF syndrome differentiation. It would be well applicable to TCM practitioners and facilitate the utilities of objective, effective and reliable computer-based diagnosis tool.

Suggested Citation

Huazhen Wang & Xin Liu & Bing Lv & Fan Yang & Yanzhu Hong, 2014. "Reliable Multi-Label Learning via Conformal Predictor and Random Forest for Syndrome Differentiation of Chronic Fatigue in Traditional Chinese Medicine," PLOS ONE, Public Library of Science, vol. 9(6), pages 1-14, June.

Handle: RePEc:plo:pone00:0099565
DOI: 10.1371/journal.pone.0099565

Download full text from publisher

References listed on IDEAS

McCrone, Paul R. & Sharpe, Michael & Chalder, Trudie & Knapp, Martin & Johnson, Anthony L. & Goldsmith, Kimberley A. & White, Peter D., 2012. "Adaptive pacing, cognitive behaviour therapy, graded exercise, and specialist medical care for chronic fatigue syndrome: a cost-effectiveness analysis," LSE Research Online Documents on Economics 45274, London School of Economics and Political Science, LSE Library.
Paul McCrone & Michael Sharpe & Trudie Chalder & Martin Knapp & Anthony L Johnson & Kimberley A Goldsmith & Peter D White, 2012. "Adaptive Pacing, Cognitive Behaviour Therapy, Graded Exercise, and Specialist Medical Care for Chronic Fatigue Syndrome: A Cost-Effectiveness Analysis," PLOS ONE, Public Library of Science, vol. 7(8), pages 1-9, August.
Grigorios Tsoumakas & Ioannis Katakis, 2007. "Multi-Label Classification: An Overview," International Journal of Data Warehousing and Mining (IJDWM), IGI Global Scientific Publishing, vol. 3(3), pages 1-13, July.
Xiao Wang & Guo-Zheng Li, 2012. "A Multi-Label Predictor for Identifying the Subcellular Locations of Singleplex and Multiplex Eukaryotic Proteins," PLOS ONE, Public Library of Science, vol. 7(5), pages 1-9, May.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Dong Wang & Jian Liu & Lijun Deng & Honglin Wang, 2022. "Intelligent diagnosis of resistance variant multiple fault locations of mine ventilation system based on ML-KNN," PLOS ONE, Public Library of Science, vol. 17(9), pages 1-17, September.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Claudia Fischer & Susanne Mayer & Nataša Perić & Judit Simon, 2022. "Harmonization issues in unit costing of service use for multi-country, multi-sectoral health economic evaluations: a scoping review," Health Economics Review, Springer, vol. 12(1), pages 1-13, December.
Desirée Vos-Vromans & Silvia Evers & Ivan Huijnen & Albère Köke & Minou Hitters & Nieke Rijnders & Menno Pont & André Knottnerus & Rob Smeets, 2017. "Economic evaluation of multidisciplinary rehabilitation treatment versus cognitive behavioural therapy for patients with chronic fatigue syndrome: A randomized controlled trial," PLOS ONE, Public Library of Science, vol. 12(6), pages 1-21, June.
Margreet S H Wortman & Joran Lokkerbol & Johannes C van der Wouden & Bart Visser & Henriëtte E van der Horst & Tim C olde Hartman, 2018. "Cost-effectiveness of interventions for medically unexplained symptoms: A systematic review," PLOS ONE, Public Library of Science, vol. 13(10), pages 1-23, October.
Ekinhan Eriskin & Özlem Terzi & Emine Dilek Taylan, 2026. "A Horizon-Adaptive Benchmarking Framework for Long-Term Reservoir Storage Forecasting Using Physics-Informed Transformers and Machine Learning," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 40(1), pages 1-16, January.
Radu Cristian Alexandru Iacob & Vlad Cristian Monea & Dan Rădulescu & Andrei-Florin Ceapă & Traian Rebedea & Ștefan Trăușan-Matu, 2020. "AlgoLabel: A Large Dataset for Multi-Label Classification of Algorithmic Challenges," Mathematics, MDPI, vol. 8(11), pages 1-18, November.
Azzini, Antonia & Cortesi, Nicola & Marrara, Stefania & Topalović, Amir, 2019. "A Multi-Label Machine Learning Approach to Support Pathologist's Histological Analysis," Proceedings of the ENTRENOVA - ENTerprise REsearch InNOVAtion Conference (2019), Rovinj, Croatia, in: Proceedings of the ENTRENOVA - ENTerprise REsearch InNOVAtion Conference, Rovinj, Croatia, 12-14 September 2019, pages 197-208, IRENET - Society for Advancing Innovation and Research in Economy, Zagreb.
Xueying Zhang & Qinbao Song, 2015. "A Multi-Label Learning Based Kernel Automatic Recommendation Method for Support Vector Machine," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-30, April.
Junming Yin & Jerry Luo & Susan A. Brown, 2021. "Learning from Crowdsourced Multi-labeling: A Variational Bayesian Approach," Information Systems Research, INFORMS, vol. 32(3), pages 752-773, September.
Karimi Dehkordi, Mohammadreza & Sattari, Fereshteh & Lefsrud, Lianne, 2025. "Creating an incident investigation framework for a complex socio-technical system: Application of multi-label text classification and Bayesian network structure learning," Reliability Engineering and System Safety, Elsevier, vol. 260(C).
Meir, Yuval & Tevet, Ofek & Tzach, Yarden & Hodassman, Shiri & Kanter, Ido, 2024. "Role of delay in brain dynamics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 654(C).
Mohanrasu, S.S. & Janani, K. & Rakkiyappan, R., 2024. "A COPRAS-based Approach to Multi-Label Feature Selection for Text Classification," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 222(C), pages 3-23.
Bocheng Li & Yunqiu Zhang & Xusheng Wu, 2022. "DLKN-MLC: A Disease Prediction Model via Multi-Label Learning," IJERPH, MDPI, vol. 19(15), pages 1-15, August.
Hamid Bekamiri & Daniel S. Hain & Roman Jurowetzki, 2021. "PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT," Papers 2103.11933, arXiv.org, revised Oct 2021.
Chaker Jebari, 2016. "Multi-Label Genre Classification of Web Pages Using an Adaptive Centroid-Based Classifier," Journal of Information & Knowledge Management (JIKM), World Scientific Publishing Co. Pte. Ltd., vol. 15(01), pages 1-21, March.
Francisco J. Ribadas-Pena & Shuyuan Cao & Víctor M. Darriba Bilbao, 2022. "Improving Large-Scale k -Nearest Neighbor Text Categorization with Label Autoencoders," Mathematics, MDPI, vol. 10(16), pages 1-22, August.
Rui Wang & Songhao Wang & Ben Niu, 2025. "Shape prior guided defect pattern classification and segmentation in wafer bin maps," Journal of Intelligent Manufacturing, Springer, vol. 36(1), pages 319-330, January.
Tao Shu & Zhiyi Wang & Huading Jia & Wenjin Zhao & Jixian Zhou & Tao Peng, 2022. "Consumers’ Opinions towards Public Health Effects of Online Games: An Empirical Study Based on Social Media Comments in China," IJERPH, MDPI, vol. 19(19), pages 1-19, October.
Bogaert, Matthias & Lootens, Justine & Van den Poel, Dirk & Ballings, Michel, 2019. "Evaluating multi-label classifiers and recommender systems in the financial service sector," European Journal of Operational Research, Elsevier, vol. 279(2), pages 620-634.
Yi-Hui Chen & Eric Jui-Lin Lu & Yu-Ting Lin & Ya-Wen Cheng, 2016. "Document overlapping clustering using formal concept analysis," Journal of Advances in Technology and Engineering Research, A/Professor Akbar A. Khatibi, vol. 2(2), pages 28-34.
Måns Karlsson & Ola Hössjer, 2024. "Classification Under Partial Reject Options," Journal of Classification, Springer;The Classification Society, vol. 41(1), pages 2-37, March.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0099565. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Reliable Multi-Label Learning via Conformal Predictor and Random Forest for Syndrome Differentiation of Chronic Fatigue in Traditional Chinese Medicine

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data