AFTER: Mitigating the Object Hallucination of LVLM via Adaptive Factual-Guided Activation Editing
Tianbo Wang, Yuqing Ma, Kewei Liao, Zhange Zhang, Simin Li, Jinyang Guo, Xianglong Liu

TL;DR
This paper introduces AFTER, a novel method for reducing object hallucinations in large vision-language models by adaptively guiding internal activations with factual textual semantics, significantly improving trustworthiness.
Contribution
The paper proposes a new adaptive editing framework, AFTER, that effectively incorporates factual guidance to mitigate hallucinations in LVLMs, addressing limitations of previous methods.
Findings
Achieves up to 16.3% reduction in hallucinations on AMBER benchmark.
Demonstrates effectiveness across three widely adopted LVLMs.
Validates the approach through extensive experiments on standard benchmarks.
Abstract
Large Vision-Language Models (LVLMs) have achieved substantial progress in cross-modal tasks. However, due to language bias, LVLMs are susceptible to object hallucination, which can be primarily divided into category, attribute, and relation hallucination, significantly impeding the trustworthy AI applications. Editing the internal activations of LVLMs has shown promising effectiveness in mitigating hallucinations with minimal cost. However, previous editing approaches neglect the effective guidance offered by factual textual semantics, thereby struggling to explicitly mitigate language bias. To address these issues, we propose Adaptive Factual-guided Visual-Textual Editing for hallucination mitigation (AFTER), which comprises Factual-Augmented Activation Steering (FAS) and Query-Adaptive Offset Optimization (QAO), to adaptively guides the original biased activations towards factual…
Peer Reviews
Decision·ICLR 2026 Poster
AFTER stands out by combining factual textual guidance with query-specific offsets to improve visual-textual activation editing. FAS uses factual annotations from images to provide clear guidance, effectively reducing language bias. Meanwhile, QAO adapts the editing process by creating query-specific offsets, allowing the model to handle distinct visual-textual associations more effectively. This innovation overcomes the limitation of previous methods that often use a single, averaged editing ve
- AFTER relies on accessing activations from open-source LLMs, which limits its applicability to closed-source models. This restricts the method’s use in private large-scale models or those that are not publicly accessible. Please explore post-processing steps based on model outputs during inference - QAO is presented as a key component to improve editing diversity and accuracy, the paper lacks in-depth analysis of its training process and its specific impact on performance, particularly in han
1. The figures and illustrations are excellent — the visuals are carefully designed and aesthetically pleasing, and they greatly help in clarifying certain issues I encountered during reading. 2. The writing is strong, and the appendix provides abundant experimental details, which convinces me that the reported results can be reproduced directly based on the information given in the paper. 3. The motivation is solid and well-founded. Similar concerns have been raised in other works, and the pape
1. The proposed method is evaluated primarily on relatively simple datasets, whose difficulty appears notably lower than the complexity illustrated in Figure 1. It remains unclear whether the approach would retain its effectiveness on more challenging benchmarks, such as HallusionBench[1] or CRPE[2]. 2. The approach relies heavily on datasets like COCO, which are richly annotated by humans, and thus depends on the availability of high-quality manual annotations. Given that the current results ar
- The trusted vs. untrusted pairing is a smart way to bootstrap supervision without manual labeling at scale. - Inference behaves like a lightweight adapter, so the method is efficient. - Results are strong across both discriminative and generative benchmarks. - Analysis is thorough, with useful ablations, including performance as the image pool grows from 50 to 500. - The out-of-distribution check is informative and suggests the method does not degrade under domain or QA-style shifts (discrimin
- The pipeline relies on ground-truth object annotations during preparation, which may be unavailable in some domains. In QAO, query-focused supervision depends on extracting objects from the question and checking membership against the COCO-derived category fact set T_c; if a queried object is not in T_c, they synthesize a negative textual sub-description. Without rich annotations or a compatible taxonomy, this step becomes brittle and hard to port. - Implementation detail: the method is model-
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Multimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis
