AgentBalance: Backbone-then-Topology Design for Cost-Effective Multi-Agent Systems under Budget Constraints
Shuowei Cai, Yansong Ning, Hao Liu

TL;DR
AgentBalance introduces a backbone-then-topology framework for cost-effective multi-agent systems that optimizes performance under explicit token-cost and latency budgets, improving deployment efficiency.
Contribution
It proposes a novel backbone-then-topology design approach that models and optimizes multi-agent systems under explicit deployment constraints, outperforming existing methods.
Findings
Achieves up to 10% performance gains under token-cost budgets.
Achieves up to 22% performance gains under latency budgets.
Generalizes well to unseen LLMs for practical deployment.
Abstract
Large Language Model (LLM)-based multi-agent systems (MAS) are becoming indispensable building blocks for web-scale applications such as web search, social network analytics, and online customer support, where cost-effectiveness is increasingly the primary constraint for large-scale deployment. While recent work improves MAS cost-effectiveness by shaping inter-agent communication topologies and selecting agent backbones, it rarely models and optimizes under explicit token-cost and latency budgets that reflect deployment constraints. This often leads to topology-first designs and suboptimal cost-effectiveness when budgets are binding. We present AgentBalance, a framework for constructing cost-effective MAS under explicit token-cost and latency budgets via a backbone-then-topology design. AgentBalance first performs backbone-oriented agent generation, constructing agents with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Digital Economy · Multimodal Machine Learning Applications · IoT and Edge/Fog Computing
