XuanHuGPT: parameter-efficient fine-tuning of large language model in the field of traditional Chinese medicine
Xuming Tong, Xiaozheng Ding, Huiru Jia, Yanhong Yuan, Liyan Liu, Yapeng Wang, Zhang Xiong, Xu Yang, Sio Kei Im, Mini Han Wang

TL;DR
XuanHuGPT is a specialized AI model for Traditional Chinese Medicine, trained with a new dataset and efficient tuning methods to improve accuracy and performance.
Contribution
A novel parameter-efficient fine-tuning approach for TCM-specific LLMs using a structured dataset and comprehensive evaluation framework.
Findings
XuanHuGPT outperforms general and TCM-specific models in accuracy, coverage, and fluency.
The XhTCM dataset with 100,000 entries enhances model training for TCM tasks.
PEFT techniques effectively balance performance and training costs for domain-specific models.
Abstract
Large Language Models (LLMs) have demonstrated exceptional generalization capabilities across various fields, including their application in Traditional Chinese Medicine (TCM). However, the performance of existing LLMs in TCM-specific tasks remains limited due to the lack of optimization for TCM knowledge during the pre-training phase, insufficient datasets, and the constraints of fine-tuning techniques. To address these challenges, this study constructs the XhTCM dataset by systematically integrating data from three authoritative sources—ShenNong_TCM_Dataset, TCMBank, and TCMIP v2.0. The dataset includes 100,000 structured entries, covering classical theories, prescription formulations, herbal pharmacology, and modern clinical practices. Based on this, we present XuanHuGPT, a domain-specific LLM tailored for TCM question answering and inference. By applying Parameter-Efficient…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraditional Chinese Medicine Studies · Machine Learning in Healthcare · Computational Drug Discovery Methods
