Dynamic Attention-Guided Context Decoding for Mitigating Context   Faithfulness Hallucinations in Large Language Models

Yanwen Huang; Yong Zhang; Ning Cheng; Zhitao Li; Shaojun Wang; Jing; Xiao

arXiv:2501.01059·cs.CL·February 26, 2025

Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models

Yanwen Huang, Yong Zhang, Ning Cheng, Zhitao Li, Shaojun Wang, Jing, Xiao

PDF

Open Access

TL;DR

This paper introduces DAGCD, a lightweight decoding framework that uses attention and uncertainty signals to reduce hallucinations in large language models, improving faithfulness and robustness without extra computational cost.

Contribution

The paper proposes DAGCD, a novel attention-guided decoding method that mitigates hallucinations in LLMs by leveraging attention distributions and uncertainty signals during decoding.

Findings

01

DAGCD significantly improves faithfulness in open-book QA tasks.

02

DAGCD enhances model robustness against hallucinations.

03

The method maintains computational efficiency during decoding.

Abstract

Large language models (LLMs) often exhibit Context Faithfulness Hallucinations, where outputs deviate from retrieved information due to incomplete context integration. Our analysis reveals a strong correlation between token-level uncertainty and hallucinations. We hypothesize that attention mechanisms inherently encode context utilization signals, supported by probing analysis. Based on these insights, we propose Dynamic Attention-Guided Context Decoding (DAGCD), a lightweight framework that leverages attention distributions and uncertainty signals in a single-pass decoding. Experiments on open-book QA datasets demonstrate DAGCD's effectiveness, yielding significant improvements in faithfulness and robustness while preserving computational efficiency.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Machine Learning in Healthcare · Digital Mental Health Interventions

MethodsSoftmax · Attention Is All You Need