CutPaste&Find: Efficient Multimodal Hallucination Detector with Visual-aid Knowledge Base
Cong-Duy Nguyen, Xiaobao Wu, Duc Anh Vu, Shuai Zhao, Thong Nguyen, Anh Tuan Luu

TL;DR
CutPaste&Find is a lightweight, training-free multimodal hallucination detection framework that uses a visual-aid knowledge base and off-the-shelf modules to efficiently identify non-existent objects in LVLM outputs.
Contribution
It introduces a novel, efficient, and training-free detection method leveraging a visual-aid knowledge base, reducing reliance on expensive API calls and iterative validation.
Findings
Achieves competitive detection performance on benchmark datasets.
Significantly more efficient and cost-effective than existing methods.
Operates without requiring LVLM inference or training.
Abstract
Large Vision-Language Models (LVLMs) have demonstrated impressive multimodal reasoning capabilities, but they remain susceptible to hallucination, particularly object hallucination where non-existent objects or incorrect attributes are fabricated in generated descriptions. Existing detection methods achieve strong performance but rely heavily on expensive API calls and iterative LVLM-based validation, making them impractical for large-scale or offline use. To address these limitations, we propose CutPaste\&Find, a lightweight and training-free framework for detecting hallucinations in LVLM-generated outputs. Our approach leverages off-the-shelf visual and linguistic modules to perform multi-step verification efficiently without requiring LVLM inference. At the core of our framework is a Visual-aid Knowledge Base that encodes rich entity-attribute relationships and associated image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPsychedelics and Drug Studies · Data Visualization and Analytics · Hallucinations in medical conditions
MethodsBalanced Selection
