FODA-PG for Enhanced Medical Imaging Narrative Generation: Adaptive Differentiation of Normal and Abnormal Attributes
Kai Shu, Yuzhuo Jia, Ziyang Zhang, Jiechao Gao

TL;DR
FODA-PG introduces a domain-adaptive graph framework for medical image report generation, effectively distinguishing normal and abnormal findings to produce more accurate and clinically relevant radiology reports.
Contribution
The paper presents a novel adaptive partitioning graph model that captures fine-grained semantic differences in medical images, improving report accuracy over existing methods.
Findings
Outperforms state-of-the-art on IU-Xray and MIMIC-CXR benchmarks.
Effectively differentiates normal and abnormal attributes.
Enhances generalization in medical report generation.
Abstract
Automatic Medical Imaging Narrative generation aims to alleviate the workload of radiologists by producing accurate clinical descriptions directly from radiological images. However, the subtle visual nuances and domain-specific terminology in medical images pose significant challenges compared to generic image captioning tasks. Existing approaches often neglect the vital distinction between normal and abnormal findings, leading to suboptimal performance. In this work, we propose FODA-PG, a novel Fine-grained Organ-Disease Adaptive Partitioning Graph framework that addresses these limitations through domain-adaptive learning. FODA-PG constructs a granular graphical representation of radiological findings by separating disease-related attributes into distinct "disease-specific" and "disease-free" categories based on their clinical significance and location. This adaptive partitioning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management
