Fast on the Easy, Deep on the Hard: Efficient Reasoning via Powered Length Penalty

Zehui Ling; Deshu Chen; Hongwei Zhang; Yifeng Jiao; Xin Guo; Yuan Cheng

arXiv:2506.10446·cs.CL·June 13, 2025

Fast on the Easy, Deep on the Hard: Efficient Reasoning via Powered Length Penalty

Zehui Ling, Deshu Chen, Hongwei Zhang, Yifeng Jiao, Xin Guo, Yuan Cheng

PDF

Open Access

TL;DR

This paper introduces a novel length penalty method to improve reasoning efficiency in large language models, reducing output length for simple tasks while maintaining or improving accuracy on complex benchmarks.

Contribution

It proposes a new reward function with a length penalty that adapts to problem complexity, enhancing reasoning efficiency and performance across diverse datasets.

Findings

01

Shortened outputs on GSM8K and MATH500 with maintained or improved accuracy.

02

Achieved higher accuracy on the challenging AIME2024 dataset.

03

Demonstrated effectiveness of the method across three benchmark datasets.

Abstract

Large language models (LLMs) have demonstrated significant advancements in reasoning capabilities, performing well on various challenging benchmarks. Techniques like Chain-of-Thought prompting have been introduced to further improve reasoning. However, these approaches frequently generate longer outputs, which in turn increase computational latency. Although some methods use reinforcement learning to shorten reasoning, they often apply uniform penalties without considering the problem's complexity, leading to suboptimal outcomes. In this study, we seek to enhance the efficiency of LLM reasoning by promoting conciseness for simpler problems while preserving sufficient reasoning for more complex ones for accuracy, thus improving the model's overall performance. Specifically, we manage the model's reasoning efficiency by dividing the reward function and including a novel penalty for output…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Explainable Artificial Intelligence (XAI)