Structuring Radiology Reports: Challenging LLMs with Lightweight Models
Johannes Moll, Louisa Fay, Asfandyar Azhar, Sophie Ostmeier, Tim Lueth, Sergios Gatidis, Curtis Langlotz, Jean-Benoit Delbrouck

TL;DR
This paper demonstrates that lightweight encoder-decoder models can effectively structure radiology reports, outperforming larger LLMs in efficiency, privacy, and resource use, making them suitable for clinical applications.
Contribution
The study introduces and benchmarks lightweight models like T5 and BERT2BERT for radiology report structuring, showing they can outperform larger LLMs with less computational cost.
Findings
Lightweight models outperform large LLMs with prompt-based adaptation.
LoRA finetuning yields modest gains but requires significantly more resources.
Lightweight models are more sustainable and privacy-preserving for clinical text processing.
Abstract
Radiology reports are critical for clinical decision-making but often lack a standardized format, limiting both human interpretability and machine learning (ML) applications. While large language models (LLMs) have shown strong capabilities in reformatting clinical text, their high computational requirements, lack of transparency, and data privacy concerns hinder practical deployment. To address these challenges, we explore lightweight encoder-decoder models (<300M parameters)-specifically T5 and BERT2BERT-for structuring radiology reports from the MIMIC-CXR and CheXpert Plus datasets. We benchmark these models against eight open-source LLMs (1B-70B), adapted using prefix prompting, in-context learning (ICL), and low-rank adaptation (LoRA) finetuning. Our best-performing lightweight model outperforms all LLMs adapted using prompt-based techniques on a human-annotated test set. While…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗StanfordAIMI/SRR-BERT2BERT-RadBERTmodel· 5 dl· ♡ 15 dl♡ 1
- 🤗StanfordAIMI/SRR-BERT2BERT-RoBERTa-biomedmodel· 3 dl· ♡ 13 dl♡ 1
- 🤗StanfordAIMI/SRR-BERT2BERT-RoBERTa-basemodel· 8 dl8 dl
- 🤗StanfordAIMI/SRR-T5-Flanmodel· 29 dl29 dl
- 🤗StanfordAIMI/SRR-T5-Basemodel· 80 dl80 dl
- 🤗StanfordAIMI/SRR-T5-SciFivemodel· 7 dl· ♡ 17 dl♡ 1
- 🤗StanfordAIMI/SRR-BERT2BERT-RoBERTa-PM-M3model· 1 dl1 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Radiology practices and education · Radiomics and Machine Learning in Medical Imaging
