ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization

Zhensheng Jin; Xinze Li; Yifan Ji; Chunyi Peng; Zhenghao Liu; Qi Shi; Yukun Yan; Shuo Wang; Furong Peng; Ge Yu

arXiv:2506.10822·cs.CL·June 13, 2025

ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization

Zhensheng Jin, Xinze Li, Yifan Ji, Chunyi Peng, Zhenghao Liu, Qi Shi, Yukun Yan, Shuo Wang, Furong Peng, Ge Yu

PDF

Open Access 1 Repo 1 Video

TL;DR

ReCUT is a novel method that balances reasoning length and accuracy in large language models by exploring diverse reasoning paths and training specialized models, resulting in shorter, accurate reasoning traces.

Contribution

ReCUT introduces stepwise exploration and preference-based training to effectively reduce reasoning length without sacrificing accuracy in LLMs.

Findings

01

Reduces reasoning length by 30-50%

02

Maintains or improves reasoning accuracy

03

Demonstrates effectiveness across multiple datasets

Abstract

Recent advances in Chain-of-Thought (CoT) prompting have substantially improved the reasoning capabilities of Large Language Models (LLMs). However, these methods often suffer from overthinking, leading to unnecessarily lengthy or redundant reasoning traces. Existing approaches attempt to mitigate this issue through curating multiple reasoning chains for training LLMs, but their effectiveness is often constrained by the quality of the generated data and prone to overfitting. To address the challenge, we propose Reasoning Compression ThroUgh Stepwise Trials (ReCUT), a novel method aimed at balancing the accuracy and length of reasoning trajectory. Specifically, ReCUT employs a stepwise exploration mechanism and a long-short switched sampling strategy, enabling LLMs to incrementally generate diverse reasoning paths. These paths are evaluated and used to construct preference pairs to train…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

neuir/recut
noneOfficial

Videos

ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications