Poison as Cure: Visual Noise for Mitigating Object Hallucinations in   LVMs

Kejia Zhang; Keda Tao; Jiasheng Tang; Huan Wang

arXiv:2501.19164·cs.CV·February 24, 2025

Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs

Kejia Zhang, Keda Tao, Jiasheng Tang, Huan Wang

PDF

Open Access

TL;DR

This paper introduces a novel visual adversarial perturbation method that applies optimized visual noise to large vision-language models, significantly reducing object hallucinations and improving factual accuracy without modifying the models.

Contribution

The paper presents a new adversarial noise technique to mitigate hallucinations in LVMs, enhancing their reliability without altering the underlying models.

Findings

01

Consistently reduces object hallucinations across 8 state-of-the-art LVMs

02

Enhances factual grounding and reduces knowledge bias

03

Validated through extensive experiments

Abstract

Large vision-language models (LVMs) extend large language models (LLMs) with visual perception capabilities, enabling them to process and interpret visual information. A major challenge compromising their reliability is object hallucination that LVMs may generate plausible but factually inaccurate information. We propose a novel visual adversarial perturbation (VAP) method to mitigate this hallucination issue. VAP alleviates LVM hallucination by applying strategically optimized visual noise without altering the base model. Our approach formulates hallucination suppression as an optimization problem, leveraging adversarial strategies to generate beneficial visual perturbations that enhance the model's factual grounding and reduce parametric knowledge bias. Extensive experimental results demonstrate that our method consistently reduces object hallucinations across 8 state-of-the-art LVMs,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNoise Effects and Management · EEG and Brain-Computer Interfaces · Neural dynamics and brain function

MethodsBalanced Selection