Continuous Training and Fine-tuning for Domain-Specific Language Models in Medical Question Answering
Zhen Guo, Yining Hua

TL;DR
This paper presents a method for rapidly adapting large language models to the Chinese medical domain through continuous training and instruction fine-tuning, achieving GPT-3.5-level performance with less resources.
Contribution
It introduces a domain-specific training pipeline combining continuous training on medical data and fine-tuning on exam examples, applicable to various specialized fields.
Findings
Model achieves performance comparable to GPT-3.5-turbo.
Significantly reduces computational resource requirements.
Effective for Chinese medical question answering.
Abstract
Large language models exhibit promising general capabilities but often lack specialized knowledge for domain-specific tasks. Developing domain experts from a base model enables a range of applications without prohibitive training costs. This work demonstrates a method using continuous training and instruction fine-tuning to rapidly adapt Llama 2 base models to the Chinese medical domain. We first conduct continuous training on 1B tokens from Chinese medical references to teach relevant vocabulary and knowledge. The models are then fine-tuned on 54K examples sourced from the Chinese National Medical Licensing Examination. Experiments on Chinese medical data confirm the effectiveness of this approach, producing a model comparable to GPT-3.5-turbo while using way less computational resource. The resulting domain-specific model could be useful for various Chinese medical applications. More…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Healthcare and Education
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Cosine Annealing · Linear Layer · Softmax · Linear Warmup With Cosine Annealing · Dropout
