AIR: Post-training Data Selection for Reasoning via Attention Head Influence

Jinrui Liu; Jeff Wu; Xuanguang Pan; Gavin Cheung; Shuai Ma; Chongyang Tao

arXiv:2512.13279·cs.CL·December 16, 2025

AIR: Post-training Data Selection for Reasoning via Attention Head Influence

Jinrui Liu, Jeff Wu, Xuanguang Pan, Gavin Cheung, Shuai Ma, Chongyang Tao

PDF

Open Access

TL;DR

This paper introduces AIR, a novel, unsupervised data selection method that uses attention head influence to improve reasoning capabilities in large language models through targeted post-training data selection.

Contribution

AIR provides a mechanistic, training-free framework that identifies critical reasoning steps and samples, enhancing reasoning distillation in LLMs beyond heuristic approaches.

Findings

01

AIR outperforms heuristic baselines in reasoning accuracy

02

It effectively isolates critical reasoning steps and samples

03

The method improves data efficiency in reasoning distillation

Abstract

LLMs achieve remarkable multi-step reasoning capabilities, yet effectively transferring these skills via post-training distillation remains challenging. Existing data selection methods, ranging from manual curation to heuristics based on length, entropy, or overall loss, fail to capture the causal importance of individual reasoning steps, limiting distillation efficiency. To address this, we propose Attention Influence for Reasoning (AIR), a principled, unsupervised and training-free framework that leverages mechanistic insights of the retrieval head to select high-value post-training data. AIR first identifies reasoning-critical attention heads of an off-the-shelf model, then constructs a weakened reference model with disabled head influence, and finally quantifies the resulting loss divergence as the Attention Influence Score. This score enables fine-grained assessment at both the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Explainable Artificial Intelligence (XAI)