TRACE: Temporal Radiology with Anatomical Change Explanation for Grounded X-ray Report Generation
OFM Riaz Rahman Aranya, Kevin Desai

TL;DR
TRACE is a novel model that jointly performs temporal comparison, change classification, and spatial localization in chest X-ray analysis, enabling detailed and explainable change detection over time.
Contribution
It introduces the first model to combine temporal comparison, change classification, and spatial grounding in radiology report generation.
Findings
Achieves over 90% grounding accuracy.
Joint learning of comparison and grounding enables change detection.
Grounding acts as a spatial attention mechanism for temporal reasoning.
Abstract
Temporal comparison of chest X-rays is fundamental to clinical radiology, enabling detection of disease progression, treatment response, and new findings. While vision-language models have advanced single-image report generation and visual grounding, no existing method combines these capabilities for temporal change detection. We introduce Temporal Radiology with Anatomical Change Explanation (TRACE), the first model that jointly performs temporal comparison, change classification, and spatial localization. Given a prior and current chest X-ray, TRACE generates natural language descriptions of interval changes (worsened, improved, stable) while grounding each finding with bounding box coordinates. TRACE demonstrates effective spatial localization with over 90% grounding accuracy, establishing a foundation for this challenging new task. Our ablation study uncovers an emergent capability:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · COVID-19 diagnosis using AI
