Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning

Jiaxi Bi; Tongxu Luo; Wenyu Du; Zhengyang Tang; Benyou Wang

arXiv:2604.16029·cs.CL·April 20, 2026

Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning

Jiaxi Bi, Tongxu Luo, Wenyu Du, Zhengyang Tang, Benyou Wang

PDF

2 Repos

TL;DR

This paper introduces STOP, a learnable internal path pruning method for large reasoning models, significantly improving efficiency and accuracy in parallel reasoning tasks.

Contribution

It provides the first systematic taxonomy of path pruning methods and proposes a novel learnable internal pruning technique validated across large models.

Findings

01

STOP outperforms existing baselines in effectiveness and efficiency.

02

Scalability of STOP is validated across models from 1.5B to 20B parameters.

03

STOP improves GPT-OSS-20B accuracy on AIME25 from 84% to nearly 90%.

Abstract

Parallel reasoning enhances Large Reasoning Models (LRMs) but incurs prohibitive costs due to futile paths caused by early errors. To mitigate this, path pruning at the prefix level is essential, yet existing research remains fragmented without a standardized framework. In this work, we propose the first systematic taxonomy of path pruning, categorizing methods by their signal source (internal vs. external) and learnability (learnable vs. non-learnable). This classification reveals the unexplored potential of learnable internal methods, motivating our proposal of STOP (Super TOken for Pruning). Extensive evaluations across LRMs ranging from 1.5B to 20B parameters demonstrate that STOP achieves superior effectiveness and efficiency compared to existing baselines. Furthermore, we rigorously validate the scalability of STOP under varying compute budgets - for instance, boosting GPT-OSS-20B…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.