Clinical Camel: An Open Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding
Augustin Toma, Patrick R. Lawler, Jimmy Ba, Rahul G. Krishnan, Barry, B. Rubin, Bo Wang

TL;DR
Clinical Camel is an open, expert-level medical language model fine-tuned from LLaMA-2, achieving state-of-the-art results on medical benchmarks and introducing dialogue-based knowledge encoding for clinical research.
Contribution
This work presents Clinical Camel, a novel open medical LLM with superior benchmark performance and a new dialogue-based knowledge encoding method for clinical data synthesis.
Findings
Outperforms GPT-3.5 on multiple medical benchmarks
Achieves 64.3% on USMLE Sample Exam
Demonstrates capabilities in synthesizing clinical notes
Abstract
We present Clinical Camel, an open large language model (LLM) explicitly tailored for clinical research. Fine-tuned from LLaMA-2 using QLoRA, Clinical Camel achieves state-of-the-art performance across medical benchmarks among openly available medical LLMs. Leveraging efficient single-GPU training, Clinical Camel surpasses GPT-3.5 in five-shot evaluations on all assessed benchmarks, including 64.3% on the USMLE Sample Exam (compared to 58.5% for GPT-3.5), 77.9% on PubMedQA (compared to 60.2%), 60.7% on MedQA (compared to 53.6%), and 54.2% on MedMCQA (compared to 51.0%). In addition to these benchmarks, Clinical Camel demonstrates its broader capabilities, such as synthesizing plausible clinical notes. This work introduces dialogue-based knowledge encoding, a novel method to synthesize conversational data from dense medical texts. While benchmark results are encouraging, extensive and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗augtoma/qCammel-70-xmodel· 1.0k dl· ♡ 281.0k dl♡ 28
- 🤗augtoma/qCammel-13model· 868 dl· ♡ 11868 dl♡ 11
- 🤗TheBloke/qCammel-70-x-GPTQmodel· 19 dl· ♡ 219 dl♡ 2
- 🤗TheBloke/qCammel-70-x-GGMLmodel· 4 dl· ♡ 34 dl♡ 3
- 🤗TheBloke/qCammel-13-GPTQmodel· 29 dl· ♡ 329 dl♡ 3
- 🤗TheBloke/qCammel-13-GGMLmodel· 6 dl· ♡ 86 dl♡ 8
- 🤗wanglab/ClinicalCamel-70Bmodel· 2.6k dl· ♡ 492.6k dl♡ 49
- 🤗TheBloke/qCammel-13-GGUFmodel· 271 dl· ♡ 3271 dl♡ 3
- 🤗TheBloke/qCammel-70-x-GGUFmodel· 268 dl· ♡ 4268 dl♡ 4
- 🤗TheBloke/qCammel-13-AWQmodel· 7 dl· ♡ 17 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling · Machine Learning in Healthcare
Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Multi-Head Attention · Attention Is All You Need · Cosine Annealing · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Attention Dropout
