Symphony-Coord: Emergent Coordination in Decentralized Agent Systems
Zhaoyang Guan, Huixi Cao, Ming Zhong, Eric Yang, Lynn Ai, Yongxin Ni, Bill Shi

TL;DR
Symphony-Coord introduces a decentralized, adaptive coordination framework for multi-agent systems that organically emerges roles through online learning, improving efficiency and robustness without relying on static roles or centralized control.
Contribution
It transforms agent coordination into an online multi-armed bandit problem with a novel two-stage beacon protocol, enabling scalable, adaptive, and fault-tolerant multi-agent task routing.
Findings
Enhances task routing efficiency in multi-agent systems.
Demonstrates robustness to agent failures and distribution shifts.
Achieves near-optimal allocation with sublinear regret bounds.
Abstract
Multi-agent large language model systems can tackle complex multi-step tasks by decomposing work and coordinating specialized behaviors. However, current coordination mechanisms typically rely on statically assigned roles and centralized controllers. As agent pools and task distributions evolve, these design choices lead to inefficient routing, poor adaptability, and fragile fault recovery capabilities. We introduce Symphony-Coord, a decentralized multi-agent framework that transforms agent selection into an online multi-armed bandit problem, enabling roles to emerge organically through interaction. The framework employs a two-stage dynamic beacon protocol: (i) a lightweight candidate screening mechanism to limit communication and computational overhead; (ii) an adaptive LinUCB selector that routes subtasks based on context features derived from task requirements and agent states,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Multimodal Machine Learning Applications · Reinforcement Learning in Robotics
