Alleviating Hallucinations in Large Vision-Language Models through   Hallucination-Induced Optimization

Xinyu Lyu; Beitao Chen; Lianli Gao; Jingkuan Song; Heng Tao Shen

arXiv:2405.15356·cs.CV·April 2, 2025·1 cites

Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization

Xinyu Lyu, Beitao Chen, Lianli Gao, Jingkuan Song, Heng Tao Shen

PDF

Open Access 1 Repo

TL;DR

This paper introduces Hallucination-Induced Optimization (HIO), a novel method that reduces hallucinations in large vision-language models by amplifying contrast between hallucinated and accurate tokens, backed by theoretical analysis and extensive experiments.

Contribution

The paper proposes a new optimization strategy, HIO, that enhances contrast decoding in LVLMs to effectively mitigate hallucinations, improving upon existing methods.

Findings

01

HIO significantly reduces hallucinations in LVLMs.

02

HIO outperforms state-of-the-art methods across multiple benchmarks.

03

Theoretical analysis supports the effectiveness of contrast amplification in hallucination mitigation.

Abstract

Although Large Visual Language Models (LVLMs) have demonstrated exceptional abilities in understanding multimodal data, they invariably suffer from hallucinations, leading to a disconnect between the generated text and the corresponding images. Almost all current visual contrastive decoding methods attempt to mitigate these hallucinations by introducing visual uncertainty information that appropriately widens the contrastive logits gap between hallucinatory and targeted ones. However, due to uncontrollable nature of the global visual uncertainty, they struggle to precisely induce the hallucinatory tokens, which severely limits their effectiveness in mitigating hallucinations and may even lead to the generation of undesired hallucinations. To tackle this issue, we conducted the theoretical analysis to promote the effectiveness of contrast decoding. Building on this insight, we introduce…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BT-C/HIO
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEpilepsy research and treatment · Brain Tumor Detection and Classification · Machine Learning in Healthcare