Knowledge Matters: Radiology Report Generation with General and Specific Knowledge
Shuxin Yang, Xian Wu, Shen Ge, Shaohua Kevin Zhou, Li Xiao

TL;DR
This paper introduces a knowledge-enhanced approach for radiology report generation that integrates general and specific medical knowledge with visual features, improving report quality over existing image captioning methods.
Contribution
It proposes a novel multi-head attention mechanism to effectively combine visual features with both general and specific medical knowledge for better report generation.
Findings
Outperforms state-of-the-art methods on IU-Xray and MIMIC-CXR datasets.
Both general and specific knowledge contribute to performance improvements.
Ablation studies confirm the effectiveness of the knowledge integration.
Abstract
Automatic radiology report generation is critical in clinics which can relieve experienced radiologists from the heavy workload and remind inexperienced radiologists of misdiagnosis or missed diagnose. Existing approaches mainly formulate radiology report generation as an image captioning task and adopt the encoder-decoder framework. However, in the medical domain, such pure data-driven approaches suffer from the following problems: 1) visual and textual bias problem; 2) lack of expert knowledge. In this paper, we propose a knowledge-enhanced radiology report generation approach introduces two types of medical knowledge: 1) General knowledge, which is input independent and provides the broad knowledge for report generation; 2) Specific knowledge, which is input dependent and provides the fine-grained knowledge for report generation. To fully utilize both the general and specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Topic Modeling
MethodsSoftmax · Linear Layer
