Research on the Construction of TCM Diagnosis Model based on Large Language Model

Mu Li; Yulin Xia; Wei Hu; Ziyi Zhang; Chengchen Zhang

doi:10.54097/fpbnvg67

Authors

Mu Li
Yulin Xia
Wei Hu
Ziyi Zhang
Chengchen Zhang

DOI:

https://doi.org/10.54097/fpbnvg67

Keywords:

TCM Syndrome Differentiation, Large Language Model, ChatGLM3, LoRA Fine Tuning, Data Annotation

Abstract

To solve many problems in TCM syndrome differentiation, including the lack of public data and quality differences, the problem of singleness and universality of models, and the lack of interpretability of models, a solution based on large language model ChatGLM3 combined with LoRA fine-tuning technology was proposed. The open source TCM-SD TCM syndrome differentiation data set was adopted, and after data filtering and integration optimization, 1027 TCM syndrome differentiation definition data sets, 41180 consultation training data and 5485 testing and verification data were obtained, so that the model could deeply learn the specialized knowledge of TCM syndromes and the actual consultation records of TCM syndrome differentiation. The experimental results show that the evaluation indexes of two different trainings using LoRA fine-tuning technology are significantly improved by about 20%.

Downloads

Download data is not yet available.

References

[1] General Office of the State Council of the People’s Republic of China. The 14th Five Year Plan for the development of traditional Chinese medicine [EB/OL]. (2022-03-03)[2024-02-03]. https://www.gov. cn/gongbao/ content/ 2022/ content_ 568 6029.htm.

[2] Wang Ye. Research on Intelligent Chinese Medicine Discriminatory Method Based on Deep Learning [D]. Henan University of Science and Technology,2022.000120.

[3] SONG Yijie, MA Suya, DAI Yasheng, et al. Key issues and technical challenges of artificial intelligence-assisted Chinese medicine discernment[J]. China Engineering Science,2024,26 (02): 234-244.

[4] YANG Lele, WANG Zhe, YAO Keyu, LIU Lihong, ZHU Yan. Prospective thoughts on the application of big language modelling in the field of traditional Chinese medicine[J/OL]. Chinese Journal of Traditional Chinese Medicine, 1-20.

[5] Wang R, Pan C, Chen J, et al. Construction of a knowledge framework system for intelligent diagnosis in Chinese medicine [J]. Journal of Traditional Chinese Medicine, 2024, 65(4): 341-346.

[6] LIU Yuehan, HUO Haobin, JIN Changuo. Practice and exploration of building enterprise-level private big language model assistant based on ChatGLM3 and RPA technology[J]. Architectural Design Management,2023,40(12):33-40.

[7] Hu E J , Shen Y , Wallis P ,et al. LoRA: Low-Rank Adaptation of Large Language Models[J]. 2021.DOI: 10. 48550/ arXiv. 2106.09685.pdf.

[8] PAPINENI K, ROUKOS S, WARD T, et al. Bleu: a method for automatic evaluation of machine translation [C]// Proceedings of the 40th annual meeting of the Association for Computational Linguistics. 2002:311-318.

[9] LIN C Y. Rouge: A package for automatic evaluation of summaries[C]//Text summarisation branches out. 2004:74-81.

[10] Zhang Jundong, Yang Songhuah,Liu Jiangfeng,et al.AIGC empowers the revitalisation of ancient Chinese medicine:the construction of the Huang-Di grand model[J].Library Forum, 2024, 44(10):103-112.

Research on the Construction of TCM Diagnosis Model based on Large Language Model

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

How to Cite

Cover

CNKI Indexing

Keywords

Latest publications