Hallucination Mitigating for Medical Report Generation
Ruoqing Zhao, Runze Xia, Piji Li

TL;DR
This paper presents KERM, a framework that reduces hallucinations in medical report generation by integrating knowledge retrieval, contextual purification, and fine-grained rewards, improving report accuracy and relevance.
Contribution
The work introduces a novel knowledge-enhanced framework with a purification module and reward-guided training to mitigate hallucinations in medical report generation.
Findings
Significant reduction in hallucinations on IU-Xray and MIMIC-CXR datasets.
Enhanced clinical relevance and supportiveness of generated reports.
Improved alignment with medical knowledge and context.
Abstract
In the realm of medical report generation (MRG), the integration of natural language processing has emerged as a vital tool to alleviate the workload of radiologists. Despite the impressive capabilities demonstrated by large vision language models (LVLMs) in understanding natural language, their susceptibility to generating plausible yet inaccurate claims, known as ``hallucinations'', raises concerns-especially in the nuanced and critical field of medical. In this work, we introduce a framework, \textbf{K}nowledge-\textbf{E}nhanced with Fine-Grained \textbf{R}einforced Rewards \textbf{M}edical Report Generation (KERM), to tackle the issue. Our approach refines the input to the LVLM by first utilizing MedCLIP for knowledge retrieval, incorporating relevant lesion fact sentences from a curated knowledge corpus. We then introduce a novel purification module to ensure the retrieved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Misinformation and Its Impacts
