HalluShift++: Bridging Language and Vision through Internal Representation Shifts for Hierarchical Hallucinations in MLLMs
Sujoy Nath, Arkaprabha Basu, Sharanya Dasgupta, and Swagatam Das

TL;DR
HalluShift++ introduces a novel internal layer analysis method to detect hallucinations in multimodal large language models, improving factual consistency assessment without relying solely on external evaluators.
Contribution
The paper presents a new approach that analyzes internal layer dynamics of MLLMs to detect hallucinations, extending techniques from text-based LLMs to multimodal models.
Findings
Effective detection of hallucinations through internal layer irregularities.
Broadens hallucination assessment from text-only to multimodal models.
Codebase available for implementation and further research.
Abstract
Multimodal Large Language Models (MLLMs) have demonstrated remarkable capabilities in vision-language understanding tasks. While these models often produce linguistically coherent output, they often suffer from hallucinations, generating descriptions that are factually inconsistent with the visual content, potentially leading to adverse consequences. Therefore, the assessment of hallucinations in MLLM has become increasingly crucial in the model development process. Contemporary methodologies predominantly depend on external LLM evaluators, which are themselves susceptible to hallucinations and may present challenges in terms of domain adaptation. In this study, we propose the hypothesis that hallucination manifests as measurable irregularities within the internal layer dynamics of MLLMs, not merely due to distributional shifts but also in the context of layer-wise analysis of specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Misinformation and Its Impacts · Face Recognition and Perception
