PMC-LLaMA: Towards Building Open-source Language Models for Medicine
Chaoyi Wu, Weixiong Lin, Xiaoman Zhang, Ya Zhang, Yanfeng Wang, Weidi, Xie

TL;DR
This paper presents PMC-LLaMA, an open-source medical language model built through domain-specific data integration and fine-tuning, achieving superior performance on medical question-answering benchmarks with a 13-billion-parameter model.
Contribution
It introduces a systematic approach for adapting general language models to the medical domain, including data collection, fine-tuning, and evaluation, with a new large-scale medical instruction dataset.
Findings
PMC-LLaMA outperforms existing models on medical QA benchmarks.
The model surpasses ChatGPT in medical question-answering tasks.
A comprehensive dataset of 202M tokens was created for instruction tuning.
Abstract
Recently, Large Language Models (LLMs) have showcased remarkable capabilities in natural language understanding. While demonstrating proficiency in everyday conversations and question-answering situations, these models frequently struggle in domains that require precision, such as medical applications, due to their lack of domain-specific knowledge. In this paper, we describe the procedure for building a powerful, open-source language model specifically designed for medicine applications, termed as PMC-LLaMA. Our contributions are threefold: (i) we systematically investigate the process of adapting a general-purpose foundation language model towards medical domain, this involves data-centric knowledge injection through the integration of 4.8M biomedical academic papers and 30K medical textbooks, as well as comprehensive fine-tuning for alignment with domain-specific instructions; (ii)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Healthcare and Education
