GAP: Graph-Based Agent Planning with Parallel Tool Use and Reinforcement Learning

Jiaqi Wu; Qinlao Zhao; Zefeng Chen; Kai Qin; Yifei Zhao; Xueqian Wang; Yuhang Yao

arXiv:2510.25320·cs.AI·October 30, 2025

GAP: Graph-Based Agent Planning with Parallel Tool Use and Reinforcement Learning

Jiaqi Wu, Qinlao Zhao, Zefeng Chen, Kai Qin, Yifei Zhao, Xueqian Wang, Yuhang Yao

PDF

TL;DR

GAP introduces a graph-based planning framework for autonomous agents that enables adaptive parallel and serial tool execution, significantly improving efficiency and accuracy in multi-step reasoning tasks involving large language models.

Contribution

The paper presents a novel graph-based planning approach that models task dependencies to optimize parallel and sequential tool use, enhancing multi-step reasoning performance.

Findings

01

GAP outperforms ReAct in multi-hop question answering accuracy.

02

GAP achieves higher tool invocation efficiency through intelligent parallelization.

03

Experimental results show substantial improvements in task accuracy and efficiency.

Abstract

Autonomous agents powered by large language models (LLMs) have shown impressive capabilities in tool manipulation for complex task-solving. However, existing paradigms such as ReAct rely on sequential reasoning and execution, failing to exploit the inherent parallelism among independent sub-tasks. This sequential bottleneck leads to inefficient tool utilization and suboptimal performance in multi-step reasoning scenarios. We introduce Graph-based Agent Planning (GAP), a novel framework that explicitly models inter-task dependencies through graph-based planning to enable adaptive parallel and serial tool execution. Our approach trains agent foundation models to decompose complex tasks into dependency-aware sub-task graphs, autonomously determining which tools can be executed in parallel and which must follow sequential dependencies. This dependency-aware orchestration achieves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.