Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models

Junjie Wu; Tsz Ting Chung; Kai Chen; Dit-Yan Yeung

arXiv:2410.23114·cs.CV·July 18, 2025

Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models

Junjie Wu, Tsz Ting Chung, Kai Chen, Dit-Yan Yeung

PDF

Open Access 1 Repo

TL;DR

This paper introduces a unified framework and benchmark for evaluating both object and relation hallucinations in large vision-language models, revealing that relation hallucination is a significant and overlooked issue.

Contribution

The paper proposes a triplet-based evaluation framework and a new benchmark, Tri-HE, to measure object and relation hallucinations simultaneously in LVLMs, and offers a simple mitigation method.

Findings

01

Relation hallucination is more severe than object hallucination in LVLMs.

02

The proposed framework generalizes across various vision-language tasks.

03

A training-free approach effectively reduces hallucinations.

Abstract

Despite the outstanding performance in vision-language reasoning, Large Vision-Language Models (LVLMs) might generate hallucinated contents that do not exist in the given image. Most existing LVLM hallucination benchmarks are constrained to evaluate the object-related hallucinations. However, the potential hallucination on the relations between two objects, i.e., relation hallucination, still lacks investigation. To remedy that, we design a unified framework to measure the object and relation hallucination in LVLMs simultaneously. The core idea of our framework is to evaluate hallucinations via (object, relation, object) triplets extracted from LVLMs' responses, making it easily generalizable to different vision-language tasks. Based on our framework, we further introduce Tri-HE, a novel Triplet-level Hallucination Evaluation benchmark which can be used to study both object and relation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wujunjie1998/tri-he
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBrain Tumor Detection and Classification