PMC-LLaMA: Towards Building Open-source Language Models for Medicine

Chaoyi Wu; Weixiong Lin; Xiaoman Zhang; Ya Zhang; Yanfeng Wang; Weidi; Xie

arXiv:2304.14454·cs.CL·August 28, 2023·56 cites

PMC-LLaMA: Towards Building Open-source Language Models for Medicine

Chaoyi Wu, Weixiong Lin, Xiaoman Zhang, Ya Zhang, Yanfeng Wang, Weidi, Xie

PDF

Open Access 1 Repo

TL;DR

This paper presents PMC-LLaMA, an open-source medical language model built through domain-specific data integration and fine-tuning, achieving superior performance on medical question-answering benchmarks with a 13-billion-parameter model.

Contribution

It introduces a systematic approach for adapting general language models to the medical domain, including data collection, fine-tuning, and evaluation, with a new large-scale medical instruction dataset.

Findings

01

PMC-LLaMA outperforms existing models on medical QA benchmarks.

02

The model surpasses ChatGPT in medical question-answering tasks.

03

A comprehensive dataset of 202M tokens was created for instruction tuning.

Abstract

Recently, Large Language Models (LLMs) have showcased remarkable capabilities in natural language understanding. While demonstrating proficiency in everyday conversations and question-answering situations, these models frequently struggle in domains that require precision, such as medical applications, due to their lack of domain-specific knowledge. In this paper, we describe the procedure for building a powerful, open-source language model specifically designed for medicine applications, termed as PMC-LLaMA. Our contributions are threefold: (i) we systematically investigate the process of adapting a general-purpose foundation language model towards medical domain, this involves data-centric knowledge injection through the integration of 4.8M biomedical academic papers and 30K medical textbooks, as well as comprehensive fine-tuning for alignment with domain-specific instructions; (ii)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chaoyi-wu/pmc-llama
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Healthcare and Education