Evaluating Hallucination in Large Vision-Language Models based on Context-Aware Object Similarities
Shounak Datta, Dhanasekar Sundararaman

TL;DR
This paper introduces CAOS, a comprehensive framework for evaluating object hallucinations in large vision-language models by combining object statistics, semantic analysis, and out-of-domain detection to better understand and quantify hallucination phenomena.
Contribution
CAOS is a novel evaluation method that integrates object statistics, semantic relationships, and out-of-domain detection to analyze hallucinations in LVLMs more thoroughly.
Findings
CAOS effectively detects both in-domain and out-of-domain hallucinated objects.
Semantic analysis reveals reasons behind object hallucinations.
Sequential dynamics influence hallucination patterns.
Abstract
Despite their impressive performance on multi-modal tasks, large vision-language models (LVLMs) tend to suffer from hallucinations. An important type is object hallucination, where LVLMs generate objects that are inconsistent with the images shown to the model. Existing works typically attempt to quantify object hallucinations by detecting and measuring the fraction of hallucinated objects in generated captions. Additionally, more recent work also measures object hallucinations by directly querying the LVLM with binary questions about the presence of likely hallucinated objects based on object statistics like top-k frequent objects and top-k co-occurring objects. In this paper, we present Context-Aware Object Similarities (CAOS), a novel approach for evaluating object hallucination in LVLMs using object statistics as well as the generated captions. CAOS uniquely integrates object…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Machine Learning in Healthcare · Functional Brain Connectivity Studies
MethodsSparse Evolutionary Training
