Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization
Xinyu Lyu, Beitao Chen, Lianli Gao, Jingkuan Song, Heng Tao Shen

TL;DR
This paper introduces Hallucination-Induced Optimization (HIO), a novel method that reduces hallucinations in large vision-language models by amplifying contrast between hallucinated and accurate tokens, backed by theoretical analysis and extensive experiments.
Contribution
The paper proposes a new optimization strategy, HIO, that enhances contrast decoding in LVLMs to effectively mitigate hallucinations, improving upon existing methods.
Findings
HIO significantly reduces hallucinations in LVLMs.
HIO outperforms state-of-the-art methods across multiple benchmarks.
Theoretical analysis supports the effectiveness of contrast amplification in hallucination mitigation.
Abstract
Although Large Visual Language Models (LVLMs) have demonstrated exceptional abilities in understanding multimodal data, they invariably suffer from hallucinations, leading to a disconnect between the generated text and the corresponding images. Almost all current visual contrastive decoding methods attempt to mitigate these hallucinations by introducing visual uncertainty information that appropriately widens the contrastive logits gap between hallucinatory and targeted ones. However, due to uncontrollable nature of the global visual uncertainty, they struggle to precisely induce the hallucinatory tokens, which severely limits their effectiveness in mitigating hallucinations and may even lead to the generation of undesired hallucinations. To tackle this issue, we conducted the theoretical analysis to promote the effectiveness of contrast decoding. Building on this insight, we introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEpilepsy research and treatment · Brain Tumor Detection and Classification · Machine Learning in Healthcare
