Reinforced Collaboration in Multi-Agent Flow Networks

Zheng Wang; Yuang Liu; Yangkai Ding

arXiv:2605.12943·cs.LG·May 14, 2026

Reinforced Collaboration in Multi-Agent Flow Networks

Zheng Wang, Yuang Liu, Yangkai Ding

PDF

1 Repo

TL;DR

MANGO is a reinforcement learning framework that optimizes multi-agent collaboration in flow networks, significantly improving task performance and efficiency across multiple benchmarks.

Contribution

The paper introduces MANGO, a novel data-driven approach that refines agent workflows using reinforcement learning and textual gradients, with a skipping mechanism for efficiency.

Findings

01

Achieves up to 12.8% performance improvement over baselines.

02

Enhances efficiency by 47.4%.

03

Generalizes well to unseen domains.

Abstract

Multi-agent systems provide a powerful way to extend large language models (LLMs) by decomposing a complex task into specialized subtasks handled by different agents. However, their performance is often hindered by error propagation, arising from suboptimal workflow design or inaccurate agent outputs, which can propagate through the agent collaboration process and degrade final results. To address the challenges, we present MANGO (Multi-Agent Network Gradient Optimization), a data-driven framework that organizes and refines agent collaboration via a flow network constructed from past successful workflows. MANGO integrates reinforcement learning and textual gradients to jointly optimize workflow paths and agent behaviors, while a skipping mechanism prevents redundant updates to well-optimized agents for improving efficiency. Extensive experiments on seven benchmarks show that MANGO…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

openJiuwen-ai/agent-store/tree/main/community/mango
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.