Towards Adapting Open-Source Large Language Models for Expert-Level Clinical Note Generation

Hanyin Wang; Chufan Gao; Bolun Liu; Qiping Xu; Guleid Hussein; Mohamad El Labban; Kingsley Iheasirim; Hariprasad Korsapati; Chuck Outcalt; Jimeng Sun

arXiv:2405.00715·cs.CL·May 28, 2025·2 cites

Towards Adapting Open-Source Large Language Models for Expert-Level Clinical Note Generation

Hanyin Wang, Chufan Gao, Bolun Liu, Qiping Xu, Guleid Hussein, Mohamad El Labban, Kingsley Iheasirim, Hariprasad Korsapati, Chuck Outcalt, Jimeng Sun

PDF

Open Access 1 Repo

TL;DR

This paper adapts open-source LLaMA-2 models for expert-level clinical note generation, combining domain-specific training, reinforcement learning, and a new distillation approach, achieving physician-level quality in generated notes.

Contribution

It introduces a comprehensive adaptation process for open-source LLMs to produce high-quality clinical notes, including a novel reinforcement learning method called DistillDirect.

Findings

01

LLaMA-Clinic generates clinically acceptable notes in 92.8% of evaluations.

02

The model matches physician notes in the 'Assessment and Plan' section.

03

Physician ratings show high agreement with expert standards.

Abstract

Proprietary Large Language Models (LLMs) such as GPT-4 and Gemini have demonstrated promising capabilities in clinical text summarization tasks. However, due to patient data privacy concerns and computational costs, many healthcare providers prefer using small, locally-hosted models over external generic LLMs. This study presents a comprehensive domain- and task-specific adaptation process for the open-source LLaMA-2 13 billion parameter model, enabling it to generate high-quality clinical notes from outpatient patient-doctor dialogues. Our process incorporates continued pretraining, supervised fine-tuning, and reinforcement learning from both AI and human feedback. We introduced a new approach, DistillDirect, for performing on-policy reinforcement learning with Gemini 1.0 Pro as the teacher model. Our resulting model, LLaMA-Clinic, can generate clinical notes comparable in quality to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hanyin88/llama-clinic
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Academic Writing and Publishing · Health Sciences Research and Education

MethodsAttention Is All You Need · Softmax · Layer Normalization · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention