EnsemHalDet: Robust VLM Hallucination Detection via Ensemble of Internal State Detectors
Ryuhei Miyazato, Shunsuke Kitada, Kei Harada

TL;DR
EnsemHalDet introduces an ensemble approach leveraging multiple internal VLM representations to improve hallucination detection accuracy and robustness in multimodal tasks.
Contribution
It proposes a novel ensemble framework that combines various internal signals of VLMs for more effective hallucination detection.
Findings
EnsemHalDet outperforms prior methods in AUC across multiple datasets.
Ensembling diverse internal signals enhances detection robustness.
The approach is effective across different VQA datasets and models.
Abstract
Vision-Language Models (VLMs) excel at multimodal tasks, but they remain vulnerable to hallucinations that are factually incorrect or ungrounded in the input image. Recent work suggests that hallucination detection using internal representations is more efficient and accurate than approaches that rely solely on model outputs. However, existing internal-representation-based methods typically rely on a single representation or detector, limiting their ability to capture diverse hallucination signals. In this paper, we propose EnsemHalDet, an ensemble-based hallucination detection framework that leverages multiple internal representations of VLMs, including attention outputs and hidden states. EnsemHalDet trains independent detectors for each representation and combines them through ensemble learning. Experimental results across multiple VQA datasets and VLMs show that EnsemHalDet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
