The current status of large language models in summarizing radiology report impressions
Danqing Hu, Shanyuan Zhang, Qing Liu, Xiaofeng Zhu, Bing Liu

TL;DR
This study evaluates the ability of eight large language models to summarize radiology report impressions across different report types, highlighting current limitations and the impact of few-shot prompting on performance.
Contribution
It provides a comprehensive assessment of LLMs in radiology impression summarization, introducing human evaluation metrics and analyzing the effects of prompt strategies.
Findings
LLMs perform comparably in completeness and correctness.
Few-shot prompts improve conciseness and verisimilitude.
LLMs cannot yet replace radiologists in impression summarization.
Abstract
Large language models (LLMs) like ChatGPT show excellent capabilities in various natural language processing tasks, especially for text generation. The effectiveness of LLMs in summarizing radiology report impressions remains unclear. In this study, we explore the capability of eight LLMs on the radiology report impression summarization. Three types of radiology reports, i.e., CT, PET-CT, and Ultrasound reports, are collected from Peking University Cancer Hospital and Institute. We use the report findings to construct the zero-shot, one-shot, and three-shot prompts with complete example reports to generate the impressions. Besides the automatic quantitative evaluation metrics, we define five human evaluation metrics, i.e., completeness, correctness, conciseness, verisimilitude, and replaceability, to evaluate the semantics of the generated impressions. Two thoracic surgeons (ZSY and LB)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Radiology practices and education · Artificial Intelligence in Healthcare and Education
