Enhancing the Traditional Chinese Medicine Capabilities of Large   Language Model through Reinforcement Learning from AI Feedback

Song Yu; Xiaofei Xu; Fangfei Xu; Li Li

arXiv:2411.00897·cs.CL·November 5, 2024·2 cites

Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback

Song Yu, Xiaofei Xu, Fangfei Xu, Li Li

PDF

Open Access

TL;DR

This paper presents a framework that enhances large language models' capabilities in Traditional Chinese Medicine by combining supervised fine-tuning with reinforcement learning from AI feedback, achieving significant improvements with limited data.

Contribution

It introduces a novel approach that leverages minimal data and reinforcement learning to improve LLM performance in specialized domains like TCM.

Findings

01

Significant performance improvement on TCM tasks.

02

Effective use of small data for domain adaptation.

03

Both supervised fine-tuning and reinforcement learning contribute to gains.

Abstract

Although large language models perform well in understanding and responding to user intent, their performance in specialized domains such as Traditional Chinese Medicine (TCM) remains limited due to lack of expertise. In addition, high-quality data related to TCM is scarce and difficult to obtain, making large language models ineffective in handling TCM tasks. In this work, we propose a framework to improve the performance of large language models for TCM tasks using only a small amount of data. First, we use medical case data for supervised fine-tuning of the large model, making it initially capable of performing TCM tasks. Subsequently, we further optimize the model's performance using reinforcement learning from AI feedback (RLAIF) to align it with the preference data. The ablation study also demonstrated the performance gain is attributed to both supervised fine-tuning and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraditional Chinese Medicine Studies

MethodsALIGN