Radiology Report Generation with Layer-Wise Anatomical Attention
Emmanuel D. Mu\~niz-De-Le\'on, Jorge A. Rosales-de-Golferichs, Ana S. Mu\~noz-Rodr\'iguez, Alejandro I. Trejo-Castro, Eduardo de Avila-Armenta, Antonio Mart\'inez-Torteya

TL;DR
This paper presents a compact, image-only model for generating chest X-ray reports that uses anatomical attention to improve clinical relevance and coherence, achieving significant performance gains on standard datasets.
Contribution
The authors introduce a novel layer-wise anatomical attention mechanism in a compact image-to-text model, enhancing report accuracy without large-scale multimodal training.
Findings
CheXpert Macro-F1 increased by 168%
RadGraph F1 improved by 9.7%
Model outperforms existing methods on key metrics
Abstract
Automatic radiology report generation is a promising application of multimodal deep learning, aiming to reduce reporting workload and improve consistency. However, current state-of-the-art (SOTA) systems - such as Multimodal AI for Radiology Applications (MAIRA-2) and Medical Pathways Language Model-Multimodal (MedPaLM-M) - depend on large-scale multimodal training, clinical metadata, and multiple imaging views, making them resource-intensive and inaccessible for most settings. We introduce a compact image-to-text architecture that generates the Findings section of chest X-ray reports from a single frontal image. The model combines a frozen Self-Distillation with No Labels v3 (DINOv3) Vision Transformer (ViT) encoder with a Generative Pre-trained Transformer 2 (GPT-2) decoder enhanced by layer-wise anatomical attention. This mechanism integrates lung and heart segmentation masks through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · COVID-19 diagnosis using AI · Radiology practices and education
