Complete Cyclic Subtask Graphs for Tool-Using LLM Agents: Flexibility, Cost, and Bottlenecks in Multi-Agent Workflows
Luay Gharzeddine, Samer Saab Jr

TL;DR
This paper investigates complete cyclic subtask graphs as a flexible multi-agent architecture for tool-using LLMs, analyzing their benefits and costs across different domains and workflows.
Contribution
It introduces a formal framework for cyclic subtask graphs and evaluates their effectiveness and efficiency in various multi-agent task settings.
Findings
Revisitation supports recovery and exploration in ALFWorld.
TextCraft favors simpler forward execution over revisitation.
Finance-Agent is bottlenecked by retrieval and evidence synthesis.
Abstract
Long-horizon tool-using tasks sometimes benefit from revisiting earlier subtasks for recovery and exploration, but added multi-agent workflow flexibility can also introduce coordination overhead and substantial inference cost. We study complete cyclic subtask graphs, a deliberately maximally flexible multi-agent architecture in which executable subtask nodes are fully connected and a unified state-analysis-and-routing agent selects transitions using natural-language criteria. This makes unrestricted revisitation explicit and directly analyzable at the subtask level. We evaluate task-specific (Spec-Cyc) and benchmark-generic (Gen-Cyc) graphs on TextCraft, ALFWorld, and Finance-Agent, with ablations over planner/executor/router strength, tool exposure (generalist vs specialized), -shot successful trajectory summaries, and fault-injected random subtask perturbations. The benchmarks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
