DAMPER: A Dual-Stage Medical Report Generation Framework with Coarse-Grained MeSH Alignment and Fine-Grained Hypergraph Matching
Xiaofei Huang, Wenting Chen, Jie Liu, Qisheng Lu, Xiaoling Luo and, Linlin Shen

TL;DR
DAMPER is a dual-stage framework that improves medical report generation by aligning image features with MeSH terms and using hypergraph matching to capture detailed semantic relationships, leading to more accurate reports.
Contribution
The paper introduces DAMPER, a novel dual-stage approach that mimics clinical report writing, combining coarse MeSH alignment with fine-grained hypergraph matching for improved report accuracy.
Findings
Outperforms state-of-the-art methods on public datasets.
Effectively captures high-order relationships between image regions and report phrases.
Generates comprehensive and accurate medical reports.
Abstract
Medical report generation is crucial for clinical diagnosis and patient management, summarizing diagnoses and recommendations based on medical imaging. However, existing work often overlook the clinical pipeline involved in report writing, where physicians typically conduct an initial quick review followed by a detailed examination. Moreover, current alignment methods may lead to misaligned relationships. To address these issues, we propose DAMPER, a dual-stage framework for medical report generation that mimics the clinical pipeline of report writing in two stages. In the first stage, a MeSH-Guided Coarse-Grained Alignment (MCG) stage that aligns chest X-ray (CXR) image features with medical subject headings (MeSH) features to generate a rough keyphrase representation of the overall impression. In the second stage, a Hypergraph-Enhanced Fine-Grained Alignment (HFG) stage that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Text and Document Classification Technologies
