BianCang: A Traditional Chinese Medicine Large Language Model
Sibo Wei, Xueping Peng, Yi-Fei Wang, Tao Shen, Jiasheng Si, Weiyu Zhang, Fa Zhu, Athanasios V. Vasilakos, Wenpeng Lu, Xiaoming Wu, Yinglong Wang

TL;DR
BianCang is a specialized large language model for traditional Chinese medicine, developed through a two-stage training process involving domain-specific knowledge injection and alignment, improving TCM diagnosis and syndrome differentiation.
Contribution
The paper introduces BianCang, a novel TCM-specific LLM trained with a comprehensive dataset and a two-stage process, addressing the gap in medical LLMs for traditional Chinese medicine.
Findings
BianCang outperforms 31 models across 11 test sets.
The model effectively enhances TCM diagnosis and syndrome differentiation.
Extensive datasets improve the understanding of TCM in LLMs.
Abstract
The surge of large language models (LLMs) has driven significant progress in medical applications, including traditional Chinese medicine (TCM). However, current medical LLMs struggle with TCM diagnosis and syndrome differentiation due to substantial differences between TCM and modern medical theory, and the scarcity of specialized, high-quality corpora. To this end, in this paper we propose BianCang, a TCM-specific LLM, using a two-stage training process that first injects domain-specific knowledge and then aligns it through targeted stimulation to enhance diagnostic and differentiation capabilities. Specifically, we constructed pre-training corpora, instruction-aligned datasets based on real hospital records, and the ChP-TCM dataset derived from the Pharmacopoeia of the People's Republic of China. We compiled extensive TCM and medical corpora for continual pre-training and supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗QLU-NLP/BianCang-Qwen2-7Bmodel· 10 dl· ♡ 310 dl♡ 3
- 🤗QLU-NLP/BianCang-Qwen2-7B-Instructmodel· 3 dl3 dl
- 🤗QLU-NLP/BianCang-Qwen2.5-7Bmodel· 3 dl· ♡ 23 dl♡ 2
- 🤗QLU-NLP/BianCang-Qwen2.5-7B-Instructmodel· 341 dl· ♡ 5341 dl♡ 5
- 🤗QLU-NLP/BianCang-Qwen2.5-14Bmodel· 4 dl· ♡ 14 dl♡ 1
- 🤗QLU-NLP/BianCang-Qwen2.5-14B-Instructmodel· 14 dl· ♡ 114 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraditional Chinese Medicine Studies
