ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine
Junying Chen, Zhenyang Cai, Zhiheng Liu, Yunjin Yang, Rongsheng Wang, Qingying Xiao, Xiangyi Feng, Zhan Su, Jing Guo, Xiang Wan, Guangjun Yu, Haizhou Li, Benyou Wang

TL;DR
ShizhenGPT is a pioneering multimodal large language model designed specifically for Traditional Chinese Medicine, integrating diverse sensory data to enhance diagnostic reasoning and understanding.
Contribution
It introduces the first multimodal LLM for TCM, curates the largest TCM dataset, and demonstrates superior multimodal reasoning and visual understanding capabilities.
Findings
Outperforms comparable-scale LLMs in TCM tasks
Leads in visual understanding among multimodal LLMs
Achieves unified perception across multiple sensory modalities
Abstract
Despite the success of large language models (LLMs) in various domains, their potential in Traditional Chinese Medicine (TCM) remains largely underexplored due to two critical barriers: (1) the scarcity of high-quality TCM data and (2) the inherently multimodal nature of TCM diagnostics, which involve looking, listening, smelling, and pulse-taking. These sensory-rich modalities are beyond the scope of conventional LLMs. To address these challenges, we present ShizhenGPT, the first multimodal LLM tailored for TCM. To overcome data scarcity, we curate the largest TCM dataset to date, comprising 100GB+ of text and 200GB+ of multimodal data, including 1.2M images, 200 hours of audio, and physiological signals. ShizhenGPT is pretrained and instruction-tuned to achieve deep TCM knowledge and multimodal reasoning. For evaluation, we collect recent national TCM qualification exams and build a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗FreedomIntelligence/ShizhenGPT-7B-LLMmodel· 300 dl· ♡ 3300 dl♡ 3
- 🤗FreedomIntelligence/ShizhenGPT-7B-Omnimodel· 80 dl· ♡ 580 dl♡ 5
- 🤗FreedomIntelligence/ShizhenGPT-7B-VLmodel· 257 dl· ♡ 2257 dl♡ 2
- 🤗FreedomIntelligence/ShizhenGPT-32B-LLMmodel· 8 dl· ♡ 28 dl♡ 2
- 🤗FreedomIntelligence/ShizhenGPT-32B-VLmodel· 126 dl· ♡ 6126 dl♡ 6
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraditional Chinese Medicine Studies
