The current status of large language models in summarizing radiology   report impressions

Danqing Hu; Shanyuan Zhang; Qing Liu; Xiaofeng Zhu; Bing Liu

arXiv:2406.02134·cs.CL·April 7, 2025·1 cites

The current status of large language models in summarizing radiology report impressions

Danqing Hu, Shanyuan Zhang, Qing Liu, Xiaofeng Zhu, Bing Liu

PDF

Open Access

TL;DR

This study evaluates the ability of eight large language models to summarize radiology report impressions across different report types, highlighting current limitations and the impact of few-shot prompting on performance.

Contribution

It provides a comprehensive assessment of LLMs in radiology impression summarization, introducing human evaluation metrics and analyzing the effects of prompt strategies.

Findings

01

LLMs perform comparably in completeness and correctness.

02

Few-shot prompts improve conciseness and verisimilitude.

03

LLMs cannot yet replace radiologists in impression summarization.

Abstract

Large language models (LLMs) like ChatGPT show excellent capabilities in various natural language processing tasks, especially for text generation. The effectiveness of LLMs in summarizing radiology report impressions remains unclear. In this study, we explore the capability of eight LLMs on the radiology report impression summarization. Three types of radiology reports, i.e., CT, PET-CT, and Ultrasound reports, are collected from Peking University Cancer Hospital and Institute. We use the report findings to construct the zero-shot, one-shot, and three-shot prompts with complete example reports to generate the impressions. Besides the automatic quantitative evaluation metrics, we define five human evaluation metrics, i.e., completeness, correctness, conciseness, verisimilitude, and replaceability, to evaluate the semantics of the generated impressions. Two thoracic surgeons (ZSY and LB)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRadiomics and Machine Learning in Medical Imaging · Radiology practices and education · Artificial Intelligence in Healthcare and Education