Revealing the Barriers of Language Agents in Planning
Jian Xie, Kexun Zhang, Jiangjie Chen, Siyu Yuan, Kai Zhang, Yikai, Zhang, Lei Li, Yanghua Xiao

TL;DR
This paper investigates why current language agents struggle with complex planning tasks, identifying key limitations in their reasoning capabilities and evaluating strategies to overcome these challenges.
Contribution
It applies feature attribution analysis to uncover fundamental barriers in language agent planning, highlighting the limited impact of constraints and questions on their reasoning.
Findings
Current language agents achieve only 15.6% on complex benchmarks.
Constraints and questions have diminishing influence on agent planning.
Existing strategies partially mitigate but do not fully resolve planning limitations.
Abstract
Autonomous planning has been an ongoing pursuit since the inception of artificial intelligence. Based on curated problem solvers, early planning agents could deliver precise solutions for specific tasks but lacked generalization. The emergence of large language models (LLMs) and their powerful reasoning capabilities has reignited interest in autonomous planning by automatically generating reasonable solutions for given tasks. However, prior research and our experiments show that current language agents still lack human-level planning abilities. Even the state-of-the-art reasoning model, OpenAI o1, achieves only 15.6% on one of the complex real-world planning benchmarks. This highlights a critical question: What hinders language agents from achieving human-level planning? Although existing studies have highlighted weak performance in agent planning, the deeper underlying issues and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · linguistics and terminology studies · Multi-Agent Systems and Negotiation
