Structuring Radiology Reports: Challenging LLMs with Lightweight Models

Johannes Moll; Louisa Fay; Asfandyar Azhar; Sophie Ostmeier; Tim Lueth; Sergios Gatidis; Curtis Langlotz; Jean-Benoit Delbrouck

arXiv:2506.00200·cs.CL·July 15, 2025

Structuring Radiology Reports: Challenging LLMs with Lightweight Models

Johannes Moll, Louisa Fay, Asfandyar Azhar, Sophie Ostmeier, Tim Lueth, Sergios Gatidis, Curtis Langlotz, Jean-Benoit Delbrouck

PDF

Open Access 7 Models

TL;DR

This paper demonstrates that lightweight encoder-decoder models can effectively structure radiology reports, outperforming larger LLMs in efficiency, privacy, and resource use, making them suitable for clinical applications.

Contribution

The study introduces and benchmarks lightweight models like T5 and BERT2BERT for radiology report structuring, showing they can outperform larger LLMs with less computational cost.

Findings

01

Lightweight models outperform large LLMs with prompt-based adaptation.

02

LoRA finetuning yields modest gains but requires significantly more resources.

03

Lightweight models are more sustainable and privacy-preserving for clinical text processing.

Abstract

Radiology reports are critical for clinical decision-making but often lack a standardized format, limiting both human interpretability and machine learning (ML) applications. While large language models (LLMs) have shown strong capabilities in reformatting clinical text, their high computational requirements, lack of transparency, and data privacy concerns hinder practical deployment. To address these challenges, we explore lightweight encoder-decoder models (<300M parameters)-specifically T5 and BERT2BERT-for structuring radiology reports from the MIMIC-CXR and CheXpert Plus datasets. We benchmark these models against eight open-source LLMs (1B-70B), adapted using prefix prompting, in-context learning (ICL), and low-rank adaptation (LoRA) finetuning. Our best-performing lightweight model outperforms all LLMs adapted using prompt-based techniques on a human-annotated test set. While…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Radiology practices and education · Radiomics and Machine Learning in Medical Imaging