Look Closer! An Adversarial Parametric Editing Framework for Hallucination Mitigation in VLMs

Jiayu Hu; Beibei Li; Jiangwei Xia; Yanjun Qin; Bing Ji; Zhongshi He

arXiv:2512.21999·cs.CV·December 29, 2025

Look Closer! An Adversarial Parametric Editing Framework for Hallucination Mitigation in VLMs

Jiayu Hu, Beibei Li, Jiangwei Xia, Yanjun Qin, Bing Ji, Zhongshi He

PDF

Open Access 1 Video

TL;DR

This paper introduces an adversarial parametric editing framework called ALEAHallu to reduce hallucinations in vision-language models by fine-tuning critical parameters with adversarial prompts that emphasize visual evidence.

Contribution

The paper presents a novel Activate-Locate-Edit adversarial paradigm for targeted parameter fine-tuning to mitigate hallucinations in VLMs, surpassing heuristic calibration methods.

Findings

01

Significant reduction in hallucinations across tasks

02

Effective identification of hallucination-prone parameter clusters

03

Improved alignment with visual inputs

Abstract

While Vision-Language Models (VLMs) have garnered increasing attention in the AI community due to their promising practical applications, they exhibit persistent hallucination issues, generating outputs misaligned with visual inputs. Recent studies attribute these hallucinations to VLMs' over-reliance on linguistic priors and insufficient visual feature integration, proposing heuristic decoding calibration strategies to mitigate them. However, the non-trainable nature of these strategies inherently limits their optimization potential. To this end, we propose an adversarial parametric editing framework for Hallucination mitigation in VLMs, which follows an \textbf{A}ctivate-\textbf{L}ocate-\textbf{E}dit \textbf{A}dversarially paradigm. Specifically, we first construct an activation dataset that comprises grounded responses (positive samples attentively anchored in visual features) and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Look Closer! An Adversarial Parametric Editing Framework for Hallucination Mitigation in VLMs· underline

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Face Recognition and Perception · Ethics and Social Impacts of AI