MedAlpaca -- An Open-Source Collection of Medical Conversational AI   Models and Training Data

Tianyu Han; Lisa C. Adams; Jens-Michalis Papaioannou; Paul; Grundmann; Tom Oberhauser; Alexei Figueroa; Alexander L\"oser and; Daniel Truhn; Keno K. Bressem

arXiv:2304.08247·cs.CL·March 20, 2025·105 cites

MedAlpaca -- An Open-Source Collection of Medical Conversational AI Models and Training Data

Tianyu Han, Lisa C. Adams, Jens-Michalis Papaioannou, Paul, Grundmann, Tom Oberhauser, Alexei Figueroa, Alexander L\"oser and, Daniel Truhn, Keno K. Bressem

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces MedAlpaca, an open-source medical dataset and models designed to enhance medical AI applications while ensuring patient privacy, demonstrating the benefits of fine-tuning LLMs for medical tasks.

Contribution

It provides a large, specialized medical dataset and evaluates the impact of fine-tuning LLMs for medical education and diagnostics.

Findings

01

Fine-tuned models outperform pre-trained-only models on medical exams.

02

Open-source models enable privacy-preserving medical AI applications.

03

Dataset facilitates effective medical-specific language understanding.

Abstract

As large language models (LLMs) like OpenAI's GPT series continue to make strides, we witness the emergence of artificial intelligence applications in an ever-expanding range of fields. In medicine, these LLMs hold considerable promise for improving medical workflows, diagnostics, patient care, and education. Yet, there is an urgent need for open-source models that can be deployed on-premises to safeguard patient privacy. In our work, we present an innovative dataset consisting of over 160,000 entries, specifically crafted to fine-tune LLMs for effective medical applications. We investigate the impact of fine-tuning these datasets on publicly accessible pre-trained LLMs, and subsequently, we juxtapose the performance of pre-trained-only models against the fine-tuned models concerning the examinations that future medical doctors must pass to achieve certification.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tuneinsight/federated-llms
pytorch

Datasets

flwrlabs/medical-meadow-medical-flashcards
dataset· 980 dl
980 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Machine Learning in Healthcare · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Residual Connection · Cosine Annealing · Softmax · Linear Layer · Byte Pair Encoding · Layer Normalization · Linear Warmup With Cosine Annealing · Dense Connections