Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning

Yiju Guo; Wenkai Yang; Zexu Sun; Ning Ding; Zhiyuan Liu; Yankai Lin

arXiv:2506.07851·cs.CL·October 27, 2025

Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning

Yiju Guo, Wenkai Yang, Zexu Sun, Ning Ding, Zhiyuan Liu, Yankai Lin

PDF

Open Access

TL;DR

This paper introduces LeaF, a two-stage framework that improves large language models' focus on critical tokens during reasoning by identifying and pruning confounding tokens, leading to better accuracy and interpretability.

Contribution

LeaF is a novel intervention-based distillation method that automatically identifies and prunes confounding tokens to enhance model focus and reasoning accuracy.

Findings

01

Improves reasoning accuracy on multiple benchmarks

02

Reduces attention to confounding tokens during inference

03

Enhances interpretability and reliability of LLMs

Abstract

Large language models (LLMs) have demonstrated significant improvements in contextual understanding. However, their ability to attend to truly critical information during long-context reasoning and generation still falls behind the pace. Specifically, our preliminary experiments reveal that certain distracting patterns can misdirect the model's attention during inference, and removing these patterns substantially improves reasoning accuracy and generation quality. We attribute this phenomenon to spurious correlations in the training data, which obstruct the model's capacity to infer authentic causal instruction-response relationships. This phenomenon may induce redundant reasoning processes, potentially resulting in significant inference overhead and, more critically, the generation of erroneous or suboptimal responses. To mitigate this, we introduce a two-stage framework called…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Topic Modeling · Multimodal Machine Learning Applications