MedAlpaca -- An Open-Source Collection of Medical Conversational AI Models and Training Data
Tianyu Han, Lisa C. Adams, Jens-Michalis Papaioannou, Paul, Grundmann, Tom Oberhauser, Alexei Figueroa, Alexander L\"oser and, Daniel Truhn, Keno K. Bressem

TL;DR
This paper introduces MedAlpaca, an open-source medical dataset and models designed to enhance medical AI applications while ensuring patient privacy, demonstrating the benefits of fine-tuning LLMs for medical tasks.
Contribution
It provides a large, specialized medical dataset and evaluates the impact of fine-tuning LLMs for medical education and diagnostics.
Findings
Fine-tuned models outperform pre-trained-only models on medical exams.
Open-source models enable privacy-preserving medical AI applications.
Dataset facilitates effective medical-specific language understanding.
Abstract
As large language models (LLMs) like OpenAI's GPT series continue to make strides, we witness the emergence of artificial intelligence applications in an ever-expanding range of fields. In medicine, these LLMs hold considerable promise for improving medical workflows, diagnostics, patient care, and education. Yet, there is an urgent need for open-source models that can be deployed on-premises to safeguard patient privacy. In our work, we present an innovative dataset consisting of over 160,000 entries, specifically crafted to fine-tune LLMs for effective medical applications. We investigate the impact of fine-tuning these datasets on publicly accessible pre-trained LLMs, and subsequently, we juxtapose the performance of pre-trained-only models against the fine-tuned models concerning the examinations that future medical doctors must pass to achieve certification.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Machine Learning in Healthcare · Topic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Residual Connection · Cosine Annealing · Softmax · Linear Layer · Byte Pair Encoding · Layer Normalization · Linear Warmup With Cosine Annealing · Dense Connections
