FODA-PG for Enhanced Medical Imaging Narrative Generation: Adaptive   Differentiation of Normal and Abnormal Attributes

Kai Shu; Yuzhuo Jia; Ziyang Zhang; Jiechao Gao

arXiv:2409.03947·cs.CV·September 9, 2024

FODA-PG for Enhanced Medical Imaging Narrative Generation: Adaptive Differentiation of Normal and Abnormal Attributes

Kai Shu, Yuzhuo Jia, Ziyang Zhang, Jiechao Gao

PDF

Open Access

TL;DR

FODA-PG introduces a domain-adaptive graph framework for medical image report generation, effectively distinguishing normal and abnormal findings to produce more accurate and clinically relevant radiology reports.

Contribution

The paper presents a novel adaptive partitioning graph model that captures fine-grained semantic differences in medical images, improving report accuracy over existing methods.

Findings

01

Outperforms state-of-the-art on IU-Xray and MIMIC-CXR benchmarks.

02

Effectively differentiates normal and abnormal attributes.

03

Enhances generalization in medical report generation.

Abstract

Automatic Medical Imaging Narrative generation aims to alleviate the workload of radiologists by producing accurate clinical descriptions directly from radiological images. However, the subtle visual nuances and domain-specific terminology in medical images pose significant challenges compared to generic image captioning tasks. Existing approaches often neglect the vital distinction between normal and abnormal findings, leading to suboptimal performance. In this work, we propose FODA-PG, a novel Fine-grained Organ-Disease Adaptive Partitioning Graph framework that addresses these limitations through domain-adaptive learning. FODA-PG constructs a granular graphical representation of radiological findings by separating disease-related attributes into distinct "disease-specific" and "disease-free" categories based on their clinical significance and location. This adaptive partitioning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management