Clinical Camel: An Open Expert-Level Medical Language Model with   Dialogue-Based Knowledge Encoding

Augustin Toma; Patrick R. Lawler; Jimmy Ba; Rahul G. Krishnan; Barry; B. Rubin; Bo Wang

arXiv:2305.12031·cs.CL·August 21, 2023·35 cites

Clinical Camel: An Open Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding

Augustin Toma, Patrick R. Lawler, Jimmy Ba, Rahul G. Krishnan, Barry, B. Rubin, Bo Wang

PDF

Open Access 2 Repos 10 Models

TL;DR

Clinical Camel is an open, expert-level medical language model fine-tuned from LLaMA-2, achieving state-of-the-art results on medical benchmarks and introducing dialogue-based knowledge encoding for clinical research.

Contribution

This work presents Clinical Camel, a novel open medical LLM with superior benchmark performance and a new dialogue-based knowledge encoding method for clinical data synthesis.

Findings

01

Outperforms GPT-3.5 on multiple medical benchmarks

02

Achieves 64.3% on USMLE Sample Exam

03

Demonstrates capabilities in synthesizing clinical notes

Abstract

We present Clinical Camel, an open large language model (LLM) explicitly tailored for clinical research. Fine-tuned from LLaMA-2 using QLoRA, Clinical Camel achieves state-of-the-art performance across medical benchmarks among openly available medical LLMs. Leveraging efficient single-GPU training, Clinical Camel surpasses GPT-3.5 in five-shot evaluations on all assessed benchmarks, including 64.3% on the USMLE Sample Exam (compared to 58.5% for GPT-3.5), 77.9% on PubMedQA (compared to 60.2%), 60.7% on MedQA (compared to 53.6%), and 54.2% on MedMCQA (compared to 51.0%). In addition to these benchmarks, Clinical Camel demonstrates its broader capabilities, such as synthesizing plausible clinical notes. This work introduces dialogue-based knowledge encoding, a novel method to synthesize conversational data from dense medical texts. While benchmark results are encouraging, extensive and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling · Machine Learning in Healthcare

Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Multi-Head Attention · Attention Is All You Need · Cosine Annealing · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Attention Dropout