Aware First, Think Less: Dynamic Boundary Self-Awareness Drives Extreme Reasoning Efficiency in Large Language Models

Qiguang Chen; Dengyun Peng; Jinhao Liu; HuiKang Su; Jiannan Guan; Libo Qin; Wanxiang Che

arXiv:2508.11582·cs.CL·August 18, 2025

Aware First, Think Less: Dynamic Boundary Self-Awareness Drives Extreme Reasoning Efficiency in Large Language Models

Qiguang Chen, Dengyun Peng, Jinhao Liu, HuiKang Su, Jiannan Guan, Libo Qin, Wanxiang Che

PDF

TL;DR

This paper introduces DR. SAF, a framework enabling large language models to dynamically adjust reasoning depth, significantly improving efficiency and reducing computational costs while maintaining accuracy.

Contribution

The paper presents a novel self-awareness framework that allows LLMs to adapt their reasoning process based on problem complexity, enhancing efficiency without sacrificing performance.

Findings

01

49.27% reduction in total response tokens

02

6.59x increase in token efficiency

03

5x decrease in training time

Abstract

Recent advancements in large language models (LLMs) have greatly improved their capabilities on complex reasoning tasks through Long Chain-of-Thought (CoT). However, this approach often results in substantial redundancy, impairing computational efficiency and causing significant delays in real-time applications. To improve the efficiency, current methods often rely on human-defined difficulty priors, which do not align with the LLM's self-awared difficulty, leading to inefficiencies. In this paper, we introduce the Dynamic Reasoning-Boundary Self-Awareness Framework (DR. SAF), which enables models to dynamically assess and adjust their reasoning depth in response to problem complexity. DR. SAF integrates three key components: Boundary Self-Awareness Alignment, Adaptive Reward Management, and a Boundary Preservation Mechanism. These components allow models to optimize their reasoning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.