Radiology Report Generation for Low-Quality X-Ray Images
Hongze Zhu, Chen Hu, Jiaxuan Jiang, Hong Liu, Yawen Huang, Ming Hu, Tianyu Wang, Zhijian Wu, Yefeng Zheng

TL;DR
This paper introduces a robust radiology report generation framework that maintains performance across varying image qualities by using a quality assessment agent and a dual-loop training strategy.
Contribution
It presents a novel framework with an automated quality assessment and a bi-level optimization training method to improve report generation on low-quality X-ray images.
Findings
Effectively mitigates performance loss on low-quality images
Establishes a new Low-quality Radiology Report Generation benchmark
Demonstrates improved diagnostic feature learning across image qualities
Abstract
Vision-Language Models (VLMs) have significantly advanced automated Radiology Report Generation (RRG). However, existing methods implicitly assume high-quality inputs, overlooking the noise and artifacts prevalent in real-world clinical environments. Consequently, current models exhibit severe performance degradation when processing suboptimal images. To bridge this gap, we propose a robust report generation framework explicitly designed for image quality variations. We first introduce an Automated Quality Assessment Agent (AQAA) to identify low-quality samples within the MIMIC-CXR dataset and establish the Low-quality Radiology Report Generation (LRRG) benchmark. To tackle degradation-induced shifts, we propose a novel Dual-loop Training Strategy leveraging bi-level optimization and gradient consistency. This approach ensures the model learns quality-agnostic diagnostic features by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
