Think Clearly: Improving Reasoning via Redundant Token Pruning

Daewon Choi; Jimin Lee; Jihoon Tack; Woomin Song; Saket Dingliwal; Sai Muralidhar Jayanthi; Bhavana Ganesh; Jinwoo Shin; Aram Galstyan; Sravan Babu Bodapati

arXiv:2507.08806·cs.AI·July 15, 2025

Think Clearly: Improving Reasoning via Redundant Token Pruning

Daewon Choi, Jimin Lee, Jihoon Tack, Woomin Song, Saket Dingliwal, Sai Muralidhar Jayanthi, Bhavana Ganesh, Jinwoo Shin, Aram Galstyan, Sravan Babu Bodapati

PDF

Open Access 1 Video

TL;DR

This paper introduces a method to improve reasoning in large language models by identifying and removing redundant tokens based on attention scores, leading to better accuracy without additional training.

Contribution

The paper proposes a structure-aware token pruning technique that enhances reasoning performance by eliminating redundancy in the reasoning process of language models.

Findings

01

Significant accuracy improvements on reasoning benchmarks

02

Effective reduction of reasoning redundancy through token pruning

03

Enhanced performance on mathematical competition datasets

Abstract

Recent large language models have shown promising capabilities in long-form reasoning, following structured chains of thought before arriving at a final answer. However, we observe that these reasoning paths tend to include substantial redundancy; analyzing attention patterns reveals that attention scores are widely scattered, particularly incorrect answers exhibit greater attention sparsity. In this paper, we demonstrate that deliberately removing this redundancy in the reasoning process significantly improves performance through clear thinking, i.e., removing distraction. Specifically, we systematically identify reasoning redundancy by measuring token-level attention scores to a special end-of-thinking token, which is appended to an explicit instruction inserted to conclude each intermediate reasoning step. Furthermore, we propose structure-aware pruning that prioritizes removing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Think Clearly: Improving Reasoning via Redundant Token Pruning· underline

Taxonomy

TopicsNatural Language Processing Techniques · Logic, programming, and type systems · Logic, Reasoning, and Knowledge

MethodsPruning