# Digitally Assisted Clinical Decision-Making in Traditional Chinese Medicine: Comparative Study of 5 Large Language Models

**Authors:** Weiwei Liu, Shuchang Miao, Qun Ma, Yuxin Li, Yinxiang Deng, Xiaoqiu Wang, Xinwei Zhang, Nuosha Ma, Hanchi Miao, Yang Si, Qingxia Shi, Lin Zhu, Hongtao Shang, Yue Wang

PMC · DOI: 10.2196/80167 · JMIR Formative Research · 2026-03-02

## TL;DR

This study compares five AI models in Traditional Chinese Medicine decision-making, finding that AI assistance improves both quality and speed of clinical decisions when used with human practitioners.

## Contribution

The study introduces the first evaluation of large language models in TCM clinical decision-making and demonstrates effective human-AI collaboration in this domain.

## Key findings

- DeepSeek-R1 outperformed other models with 96.7% knowledge accuracy and high clinical case analysis scores.
- Human-AI collaboration improved decision quality by 16.1% and reduced time by 66.1% compared to physician-only decisions.
- AI assistance was most beneficial for prescription formulation and medication selection.

## Abstract

Traditional Chinese medicine (TCM) clinical decision-making involves complex integration of syndrome differentiation, constitutional assessment, and individualized treatment selection, creating challenges for standardization and quality assurance. While large language models (LLMs) demonstrate capabilities in medical knowledge integration and clinical reasoning, their application to TCM remains largely unexplored, particularly regarding syndrome differentiation principles and prescription formulation.

This study evaluated 5 contemporary LLMs in TCM clinical decision-making and assessed human–artificial intelligence (AI) collaboration compared with independent approaches. Specific objectives were to benchmark LLM performance in TCM knowledge assessment, evaluate clinical case analysis capabilities, identify the optimal model, and assess the quality, efficiency, and acceptability of human-AI collaboration.

In total, 5 mainstream LLMs were evaluated—Claude 3.7 Sonnet-Extended (Anthropic), ChatGPT 4.5 (OpenAI), Grok3-DeepSearch (xAI), Gemini 2.0 Flash Thinking Experimental (Google), and DeepSeek-R1 (DeepSeek). The evaluation consisted of four phases, (1) TCM knowledge assessment using 160 standardized questions, (2) clinical case analysis of 30 cases representing different disease systems and complexity levels, (3) optimal model selection using weighted scoring (40% knowledge and 60% clinical analysis), and (4) clinical application assessment involving 10 TCM practitioners and 2 experts comparing physician-only, AI-only, and human-AI collaboration across 5 clinical cases. Statistical analysis included descriptive statistics, reliability analysis, comparative testing, and regression analysis.

DeepSeek-R1 demonstrated superior performance across both evaluation domains, achieving 96.7% accuracy in knowledge assessment and 17.31/20 (SD 2.65) in clinical case analysis, significantly outperforming other models (P<.001). Human-AI collaboration achieved significant improvements compared with physician-only decision-making, with 16.1% quality enhancement (33.62 vs 28.97; P<.001) and 66.1% time reduction (162.6 s vs 479.2 s; P<.001). System usability was rated favorably (System Usability Scale score=76.8; P=.002), with high acceptance rates (74.25% adoption, 24% modification, and 1.75% rejection). AI assistance provided the greatest benefits in prescription formulation and medication selection (P<.001).

LLMs, particularly DeepSeek-R1, demonstrate substantial capabilities in TCM knowledge assessment and clinical case analysis. Human-AI collaboration significantly enhanced clinical decision-making quality and efficiency while maintaining high physician acceptance. These findings provide compelling evidence for the clinical value of AI-assisted decision-making in TCM, suggesting potential solutions to current challenges in knowledge standardization, clinical training, and health care delivery efficiency. Strategic implementation of AI assistance could significantly enhance the quality, efficiency, and accessibility of TCM care while preserving fundamental principles of individualized treatment.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12954686/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12954686/full.md

## References

19 references — full list in the complete paper: https://tomesphere.com/paper/PMC12954686/full.md

---
Source: https://tomesphere.com/paper/PMC12954686