MHSA: A Lightweight Framework for Mitigating Hallucinations via Steered Attention in LVLMs

Wei Ding; Yilin Li; Yudong Zhang; Ruobing Xie; Xingwu Sun; Jiansheng Chen; Yu Wang

arXiv:2605.14966·cs.CV·May 15, 2026

MHSA: A Lightweight Framework for Mitigating Hallucinations via Steered Attention in LVLMs

Wei Ding, Yilin Li, Yudong Zhang, Ruobing Xie, Xingwu Sun, Jiansheng Chen, Yu Wang

PDF

TL;DR

This paper introduces MHSA, a lightweight framework that mitigates hallucinations in large vision-language models by correcting cross-modal attention patterns without altering the original model parameters.

Contribution

MHSA extends cross-modal attention from detection to mitigation of hallucinations, using a simple MLP to produce corrected attention during inference.

Findings

01

MHSA effectively reduces hallucinations across various datasets and LVLMs.

02

Replacing original attention with corrected attention improves model reliability.

03

The framework does not require modifying existing LVLM parameters.

Abstract

Large vision-language models (LVLMs) have achieved remarkable performance across diverse multimodal tasks, yet they continue to suffer from hallucinations, generating content that is inconsistent with the visual input. Prior work DHCP (Detecting Hallucinations by Cross-modal Attention Pattern) has explored hallucination detection from the perspective of cross-modal attention, but does not address hallucination mitigation. In this paper, we propose MHSA (Mitigating Hallucinations via Steered Attention), a lightweight framework that mitigates hallucinations by learning to correct cross-modal attention patterns in LVLMs. MHSA trains a simple three-layer MLP generator to produce corrected attention, guided by supervisory signals from the DHCP discriminator and the LVLM itself. During inference, MHSA mitigates both discriminative and generative hallucinations across various datasets and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.