TL;DR
MANGO is a reinforcement learning framework that optimizes multi-agent collaboration in flow networks, significantly improving task performance and efficiency across multiple benchmarks.
Contribution
The paper introduces MANGO, a novel data-driven approach that refines agent workflows using reinforcement learning and textual gradients, with a skipping mechanism for efficiency.
Findings
Achieves up to 12.8% performance improvement over baselines.
Enhances efficiency by 47.4%.
Generalizes well to unseen domains.
Abstract
Multi-agent systems provide a powerful way to extend large language models (LLMs) by decomposing a complex task into specialized subtasks handled by different agents. However, their performance is often hindered by error propagation, arising from suboptimal workflow design or inaccurate agent outputs, which can propagate through the agent collaboration process and degrade final results. To address the challenges, we present MANGO (Multi-Agent Network Gradient Optimization), a data-driven framework that organizes and refines agent collaboration via a flow network constructed from past successful workflows. MANGO integrates reinforcement learning and textual gradients to jointly optimize workflow paths and agent behaviors, while a skipping mechanism prevents redundant updates to well-optimized agents for improving efficiency. Extensive experiments on seven benchmarks show that MANGO…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
