R2GenCSR: Mining Contextual and Residual Information for LLMs-based Radiology Report Generation
Xiao Wang, Yuehang Li, Fuling Wang, Shiao Wang, Chuanfu Li, Bo Jiang

TL;DR
This paper introduces R2GenCSR, a novel framework that enhances radiology report generation by mining contextual and residual information using an efficient vision backbone and context retrieval, leading to improved performance and reduced complexity.
Contribution
The paper proposes a new context-guided framework with a linear complexity vision backbone and context retrieval strategy, improving report generation quality and efficiency.
Findings
Effective use of Mamba backbone with linear complexity
Context retrieval improves feature representation
Validated on three X-ray datasets
Abstract
Inspired by the tremendous success of Large Language Models (LLMs), existing Radiology report generation methods attempt to leverage large models to achieve better performance. They usually adopt a Transformer to extract the visual features of a given X-ray image, and then, feed them into the LLM for text generation. How to extract more effective information for the LLMs to help them improve final results is an urgent problem that needs to be solved. Additionally, the use of visual Transformer models also brings high computational complexity. To address these issues, this paper proposes a novel context-guided efficient radiology report generation framework. Specifically, we introduce the Mamba as the vision backbone with linear complexity, and the performance obtained is comparable to that of the strong Transformer model. More importantly, we perform context retrieval from the training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Radiomics and Machine Learning in Medical Imaging
MethodsSparse Evolutionary Training · Linear Layer · Residual Connection · Layer Normalization · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam · Attention Is All You Need · Byte Pair Encoding · Absolute Position Encodings
