Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation
Xiao Wang, Fuling Wang, Haowen Wang, Bo Jiang, Chuanfu Li, Yaowei, Wang, Yonghong Tian, Jin Tang

TL;DR
This paper introduces an associative memory-enhanced model for X-ray report generation that leverages visual and historical report information, significantly improving report quality and achieving state-of-the-art results on multiple datasets.
Contribution
It proposes a novel memory-augmented framework that mimics doctors' report writing by integrating visual region mining and report memory retrieval using Hopfield networks.
Findings
Achieves state-of-the-art performance on IU X-ray, MIMIC-CXR, and Chexpert Plus datasets.
Effectively associates visual regions with diseases for better report accuracy.
Utilizes Hopfield networks for memory retrieval, enhancing report quality.
Abstract
X-ray image based medical report generation achieves significant progress in recent years with the help of the large language model, however, these models have not fully exploited the effective information in visual image regions, resulting in reports that are linguistically sound but insufficient in describing key diseases. In this paper, we propose a novel associative memory-enhanced X-ray report generation model that effectively mimics the process of professional doctors writing medical reports. It considers both the mining of global and local visual information and associates historical report information to better complete the writing of the current report. Specifically, given an X-ray image, we first utilize a classification model along with its activation maps to accomplish the mining of visual regions highly associated with diseases and the learning of disease query tokens.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Semantic Web and Ontologies
