Continuous Training and Fine-tuning for Domain-Specific Language Models   in Medical Question Answering

Zhen Guo; Yining Hua

arXiv:2311.00204·cs.CL·November 2, 2023·1 cites

Continuous Training and Fine-tuning for Domain-Specific Language Models in Medical Question Answering

Zhen Guo, Yining Hua

PDF

Open Access

TL;DR

This paper presents a method for rapidly adapting large language models to the Chinese medical domain through continuous training and instruction fine-tuning, achieving GPT-3.5-level performance with less resources.

Contribution

It introduces a domain-specific training pipeline combining continuous training on medical data and fine-tuning on exam examples, applicable to various specialized fields.

Findings

01

Model achieves performance comparable to GPT-3.5-turbo.

02

Significantly reduces computational resource requirements.

03

Effective for Chinese medical question answering.

Abstract

Large language models exhibit promising general capabilities but often lack specialized knowledge for domain-specific tasks. Developing domain experts from a base model enables a range of applications without prohibitive training costs. This work demonstrates a method using continuous training and instruction fine-tuning to rapidly adapt Llama 2 base models to the Chinese medical domain. We first conduct continuous training on 1B tokens from Chinese medical references to teach relevant vocabulary and knowledge. The models are then fine-tuned on 54K examples sourced from the Chinese National Medical Licensing Examination. Experiments on Chinese medical data confirm the effectiveness of this approach, producing a model comparable to GPT-3.5-turbo while using way less computational resource. The resulting domain-specific model could be useful for various Chinese medical applications. More…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Healthcare and Education

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Cosine Annealing · Linear Layer · Softmax · Linear Warmup With Cosine Annealing · Dropout