GAP: Graph-Based Agent Planning with Parallel Tool Use and Reinforcement Learning
Jiaqi Wu, Qinlao Zhao, Zefeng Chen, Kai Qin, Yifei Zhao, Xueqian Wang, Yuhang Yao

TL;DR
GAP introduces a graph-based planning framework for autonomous agents that enables adaptive parallel and serial tool execution, significantly improving efficiency and accuracy in multi-step reasoning tasks involving large language models.
Contribution
The paper presents a novel graph-based planning approach that models task dependencies to optimize parallel and sequential tool use, enhancing multi-step reasoning performance.
Findings
GAP outperforms ReAct in multi-hop question answering accuracy.
GAP achieves higher tool invocation efficiency through intelligent parallelization.
Experimental results show substantial improvements in task accuracy and efficiency.
Abstract
Autonomous agents powered by large language models (LLMs) have shown impressive capabilities in tool manipulation for complex task-solving. However, existing paradigms such as ReAct rely on sequential reasoning and execution, failing to exploit the inherent parallelism among independent sub-tasks. This sequential bottleneck leads to inefficient tool utilization and suboptimal performance in multi-step reasoning scenarios. We introduce Graph-based Agent Planning (GAP), a novel framework that explicitly models inter-task dependencies through graph-based planning to enable adaptive parallel and serial tool execution. Our approach trains agent foundation models to decompose complex tasks into dependency-aware sub-task graphs, autonomously determining which tools can be executed in parallel and which must follow sequential dependencies. This dependency-aware orchestration achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
