Multi-Agent Collaboration via Evolving Orchestration

Yufan Dang; Chen Qian; Xueheng Luo; Jingru Fan; Zihao Xie; Ruijie Shi; Weize Chen; Cheng Yang; Xiaoyin Che; Ye Tian; Xuantang Xiong; Lei Han; Zhiyuan Liu; Maosong Sun

arXiv:2505.19591·cs.CL·October 22, 2025

Multi-Agent Collaboration via Evolving Orchestration

Yufan Dang, Chen Qian, Xueheng Luo, Jingru Fan, Zihao Xie, Ruijie Shi, Weize Chen, Cheng Yang, Xiaoyin Che, Ye Tian, Xuantang Xiong, Lei Han, Zhiyuan Liu, Maosong Sun

PDF

Open Access 1 Video

TL;DR

This paper introduces a dynamic, reinforcement learning-based orchestrator for multi-agent LLM collaboration, enabling flexible, efficient collective reasoning that adapts to task complexity and reduces computational costs.

Contribution

It proposes a novel puppeteer-style paradigm with an adaptive orchestrator trained via reinforcement learning for improved multi-agent LLM collaboration.

Findings

01

Achieves superior performance over static structures

02

Reduces computational costs in multi-agent tasks

03

Emergence of cyclic reasoning structures enhances reasoning quality

Abstract

Large language models (LLMs) have achieved remarkable results across diverse downstream tasks, but their monolithic nature restricts scalability and efficiency in complex problem-solving. While recent research explores multi-agent collaboration among LLMs, most approaches rely on static organizational structures that struggle to adapt as task complexity and agent numbers grow, resulting in coordination overhead and inefficiencies. To this end, we propose a puppeteer-style paradigm for LLM-based multi-agent collaboration, where a centralized orchestrator ("puppeteer") dynamically directs agents ("puppets") in response to evolving task states. This orchestrator is trained via reinforcement learning to adaptively sequence and prioritize agents, enabling flexible and evolvable collective reasoning. Experiments on closed- and open-domain scenarios show that this method achieves superior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Multi-Agent Collaboration via Evolving Orchestration· slideslive

Taxonomy

TopicsCollaboration in agile enterprises · Multi-Agent Systems and Negotiation