NaviAgent: Graph-Driven Bilevel Planning for Scalable Tool Orchestration

Yan Jiang; Hao Zhou; Lizhong GU; Tianlong Li; Ruinan Jin; Wanqi Zhou; Ai Han

arXiv:2506.19500·cs.AI·May 22, 2026

NaviAgent: Graph-Driven Bilevel Planning for Scalable Tool Orchestration

Yan Jiang, Hao Zhou, Lizhong GU, Tianlong Li, Ruinan Jin, Wanqi Zhou, Ai Han

PDF

TL;DR

NaviAgent introduces a graph-based bilevel architecture for scalable, robust tool invocation in LLMs, improving task success and efficiency in large tool ecosystems.

Contribution

It proposes a novel graph-driven bilevel framework with a Tool World Navigation Model for improved scalability and robustness in tool orchestration.

Findings

01

Achieves a 13.1 point increase in task success rate on complex tasks.

02

Demonstrates consistent 4.3-12.0 point gains across real APIs and domains.

03

Reduces steps and latency in tool invocation sequences.

Abstract

Large Language Models (LLMs) increasingly act as function-call agents that invoke external tools to tackle tasks beyond their static knowledge. However, they typically invoke tools one at a time without a global view of task structure. As tools often depend on one another, this leads to error accumulation and poor scalability, particularly when scaling to hundreds or thousands of tools. To address these limitations, we propose NaviAgent, an explicit bilevel architecture that decouples task planning from tool execution through graph-based modeling of tool relations. At the planning level, the LLM-based agent decides whether to respond directly, clarify intent, or retrieve and execute a toolchain independent of inter-tool complexity. At the execution level, a Tool World Navigation Model (TWNM) encodes structural and behavioral relations among tools, steering the agent to compose scalable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.