Limited Reasoning Space: The cage of long-horizon reasoning in LLMs
Zhenyu Li, Guanlin Wu, Cheems Wang, Yongqiang Zhao

TL;DR
This paper investigates the limitations of large language models in long-horizon reasoning, revealing that excessive planning can impair performance, and introduces Halo, a control framework that optimizes reasoning boundaries for better task-solving.
Contribution
The paper introduces Halo, a novel model predictive control framework that dynamically manages reasoning scope in LLMs, addressing over-planning issues in long-horizon tasks.
Findings
Halo outperforms static planning baselines on complex tasks.
Over-planning can lead to performance collapse in LLM reasoning.
Dynamic regulation of reasoning boundaries improves long-horizon reasoning.
Abstract
The test-time compute strategy, such as Chain-of-Thought (CoT), has significantly enhanced the ability of large language models to solve complex tasks like logical reasoning. However, empirical studies indicate that simply increasing the compute budget can sometimes lead to a collapse in test-time performance when employing typical task decomposition strategies such as CoT. This work hypothesizes that reasoning failures with larger compute budgets stem from static planning methods, which hardly perceive the intrinsic boundaries of LLM reasoning. We term it as the Limited Reasoning Space hypothesis and perform theoretical analysis through the lens of a non-autonomous stochastic dynamical system. This insight suggests that there is an optimal range for compute budgets; over-planning can lead to redundant feedback and may even impair reasoning capabilities. To exploit the compute-scaling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Logic, Reasoning, and Knowledge · Constraint Satisfaction and Optimization
