Anatomical Region-Guided Contrastive Decoding: A Plug-and-Play Strategy for Mitigating Hallucinations in Medical VLMs

Xiao Liang; Chenxi Liu; Zhi Ma; Di Wang; Bin Jing; Quan Wang; Yuanyuan Shi

arXiv:2512.17189·cs.CV·December 22, 2025

Anatomical Region-Guided Contrastive Decoding: A Plug-and-Play Strategy for Mitigating Hallucinations in Medical VLMs

Xiao Liang, Chenxi Liu, Zhi Ma, Di Wang, Bin Jing, Quan Wang, Yuanyuan Shi

PDF

Open Access 1 Video

TL;DR

This paper introduces ARCD, a plug-and-play method that uses anatomical region guidance to reduce hallucinations in medical vision-language models, improving their reliability and diagnostic accuracy across various medical imaging modalities.

Contribution

We propose a novel anatomical region-guided contrastive decoding approach that effectively mitigates hallucinations in MedVLMs without requiring costly retraining or annotations.

Findings

01

Significant reduction in hallucinations across multiple datasets.

02

Improved regional understanding and diagnostic accuracy.

03

Effective in diverse medical imaging modalities.

Abstract

Medical Vision-Language Models (MedVLMs) show immense promise in clinical applicability. However, their reliability is hindered by hallucinations, where models often fail to derive answers from visual evidence, instead relying on learned textual priors. Existing mitigation strategies for MedVLMs have distinct limitations: training-based methods rely on costly expert annotations, limiting scalability, while training-free interventions like contrastive decoding, though data-efficient, apply a global, untargeted correction whose effects in complex real-world clinical settings can be unreliable. To address these challenges, we introduce Anatomical Region-Guided Contrastive Decoding (ARCD), a plug-and-play strategy that mitigates hallucinations by providing targeted, region-specific guidance. Our module leverages an anatomical mask to direct a three-tiered contrastive decoding process. By…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Anatomical Region-Guided Contrastive Decoding: A Plug-and-Play Strategy for Mitigating Hallucinations in Medical VLMs· underline

Taxonomy

TopicsMultimodal Machine Learning Applications · COVID-19 diagnosis using AI · Adversarial Robustness in Machine Learning