V-Loop: Visual Logical Loop Verification for Hallucination Detection in Medical Visual Question Answering
Mengyuan Jin, Zehui Liao, Yong Xia

TL;DR
V-Loop is a training-free, plug-and-play framework that verifies the factual correctness of answers in medical visual question answering by establishing a visually grounded logical loop, effectively detecting hallucinations in high-stakes medical scenarios.
Contribution
It introduces a novel visual logical loop verification method that directly checks answer factuality without training, improving hallucination detection in medical VQA systems.
Findings
V-Loop outperforms existing methods on multiple benchmarks.
It enhances the accuracy of hallucination detection in medical VQA.
V-Loop is efficient and complements uncertainty-based approaches.
Abstract
Multimodal Large Language Models (MLLMs) have shown remarkable capability in assisting disease diagnosis in medical visual question answering (VQA). However, their outputs remain vulnerable to hallucinations (i.e., responses that contradict visual facts), posing significant risks in high-stakes medical scenarios. Recent introspective detection methods, particularly uncertainty-based approaches, offer computational efficiency but are fundamentally indirect, as they estimate predictive uncertainty for an image-question pair rather than verifying the factual correctness of a specific answer. To address this limitation, we propose Visual Logical Loop Verification (V-Loop), a training-free and plug-and-play framework for hallucination detection in medical VQA. V-Loop introduces a bidirectional reasoning process that forms a visually grounded logical loop to verify factual correctness. Given…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
