SYMPHONY: Synergistic Multi-agent Planning with Heterogeneous Language Model Assembly
Wei Zhu, Zhiwen Tang, Kun Yue

TL;DR
SYMPHONY introduces a multi-agent framework using diverse language models to improve exploration and performance in complex planning tasks, outperforming existing methods with open-source and cloud-based models.
Contribution
The paper presents a novel multi-agent planning approach that combines heterogeneous language models to enhance exploration and planning efficiency.
Findings
SYMPHONY outperforms state-of-the-art baselines on benchmark tasks.
Heterogeneous multi-agent coordination improves planning diversity.
Open-source LLMs are sufficient for strong performance, further enhanced by cloud-based models.
Abstract
Recent advancements have increasingly focused on leveraging large language models (LLMs) to construct autonomous agents for complex problem-solving tasks. However, existing approaches predominantly employ a single-agent framework to generate search branches and estimate rewards during Monte Carlo Tree Search (MCTS) planning. This single-agent paradigm inherently limits exploration capabilities, often resulting in insufficient diversity among generated branches and suboptimal planning performance. To overcome these limitations, we propose Synergistic Multi-agent Planning with Heterogeneous langauge model assembly (SYMPHONY), a novel multi-agent planning framework that integrates a pool of heterogeneous language model-based agents. By leveraging diverse reasoning patterns across agents, SYMPHONY enhances rollout diversity and facilitates more effective exploration. Empirical results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Multimodal Machine Learning Applications · Reinforcement Learning in Robotics
