Evaluating Hallucination in Large Vision-Language Models based on   Context-Aware Object Similarities

Shounak Datta; Dhanasekar Sundararaman

arXiv:2501.15046·cs.CV·January 28, 2025

Evaluating Hallucination in Large Vision-Language Models based on Context-Aware Object Similarities

Shounak Datta, Dhanasekar Sundararaman

PDF

Open Access

TL;DR

This paper introduces CAOS, a comprehensive framework for evaluating object hallucinations in large vision-language models by combining object statistics, semantic analysis, and out-of-domain detection to better understand and quantify hallucination phenomena.

Contribution

CAOS is a novel evaluation method that integrates object statistics, semantic relationships, and out-of-domain detection to analyze hallucinations in LVLMs more thoroughly.

Findings

01

CAOS effectively detects both in-domain and out-of-domain hallucinated objects.

02

Semantic analysis reveals reasons behind object hallucinations.

03

Sequential dynamics influence hallucination patterns.

Abstract

Despite their impressive performance on multi-modal tasks, large vision-language models (LVLMs) tend to suffer from hallucinations. An important type is object hallucination, where LVLMs generate objects that are inconsistent with the images shown to the model. Existing works typically attempt to quantify object hallucinations by detecting and measuring the fraction of hallucinated objects in generated captions. Additionally, more recent work also measures object hallucinations by directly querying the LVLM with binary questions about the presence of likely hallucinated objects based on object statistics like top-k frequent objects and top-k co-occurring objects. In this paper, we present Context-Aware Object Similarities (CAOS), a novel approach for evaluating object hallucination in LVLMs using object statistics as well as the generated captions. CAOS uniquely integrates object…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Machine Learning in Healthcare · Functional Brain Connectivity Studies

MethodsSparse Evolutionary Training