Aqulia-Med LLM: Pioneering Full-Process Open-Source Medical Language   Models

Lulu Zhao; Weihao Zeng; Xiaofeng Shi; Hua Zhou; Donglin Hao; Yonghua; Lin

arXiv:2406.12182·cs.CL·June 19, 2024·1 cites

Aqulia-Med LLM: Pioneering Full-Process Open-Source Medical Language Models

Lulu Zhao, Weihao Zeng, Xiaofeng Shi, Hua Zhou, Donglin Hao, Yonghua, Lin

PDF

Open Access 1 Models 2 Datasets

TL;DR

Aquila-Med is a bilingual open-source medical language model that leverages continue pre-training, supervised fine-tuning, and reinforcement learning to improve performance across medical tasks and specialties.

Contribution

It introduces a comprehensive training pipeline and high-quality datasets for open-source medical LLMs, advancing performance in medical dialogue and question answering.

Findings

01

Aquila-Med outperforms baseline models in medical dialogue tasks.

02

The model demonstrates high accuracy on medical multiple-choice questions.

03

Open-sourcing datasets and training process benefits the research community.

Abstract

Recently, both closed-source LLMs and open-source communities have made significant strides, outperforming humans in various general domains. However, their performance in specific professional fields such as medicine, especially within the open-source community, remains suboptimal due to the complexity of medical knowledge. We propose Aquila-Med, a bilingual medical LLM based on Aquila, addressing these challenges through continue pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF). We construct a large-scale Chinese and English medical dataset for continue pre-training and a high-quality SFT dataset, covering extensive medical specialties. Additionally, we develop a high-quality Direct Preference Optimization (DPO) dataset for further alignment. Aquila-Med achieves notable results across single-turn, multi-turn dialogues, and medical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
BAAI/AquilaMed-RL
model· 11 dl· ♡ 13
11 dl♡ 13

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies

MethodsShrink and Fine-Tune