Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback
Song Yu, Xiaofei Xu, Fangfei Xu, Li Li

TL;DR
This paper presents a framework that enhances large language models' capabilities in Traditional Chinese Medicine by combining supervised fine-tuning with reinforcement learning from AI feedback, achieving significant improvements with limited data.
Contribution
It introduces a novel approach that leverages minimal data and reinforcement learning to improve LLM performance in specialized domains like TCM.
Findings
Significant performance improvement on TCM tasks.
Effective use of small data for domain adaptation.
Both supervised fine-tuning and reinforcement learning contribute to gains.
Abstract
Although large language models perform well in understanding and responding to user intent, their performance in specialized domains such as Traditional Chinese Medicine (TCM) remains limited due to lack of expertise. In addition, high-quality data related to TCM is scarce and difficult to obtain, making large language models ineffective in handling TCM tasks. In this work, we propose a framework to improve the performance of large language models for TCM tasks using only a small amount of data. First, we use medical case data for supervised fine-tuning of the large model, making it initially capable of performing TCM tasks. Subsequently, we further optimize the model's performance using reinforcement learning from AI feedback (RLAIF) to align it with the preference data. The ablation study also demonstrated the performance gain is attributed to both supervised fine-tuning and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraditional Chinese Medicine Studies
MethodsALIGN
