Multi-Agent Evolve: LLM Self-Improve through Co-evolution

Yixing Chen; Yiding Wang; Siqi Zhu; Haofei Yu; Tao Feng; Muhan Zhang; Mostofa Patwary; Jiaxuan You

arXiv:2510.23595·cs.AI·October 31, 2025

Multi-Agent Evolve: LLM Self-Improve through Co-evolution

Yixing Chen, Yiding Wang, Siqi Zhu, Haofei Yu, Tao Feng, Muhan Zhang, Mostofa Patwary, Jiaxuan You

PDF

3 Models

TL;DR

Multi-Agent Evolve (MAE) is a scalable framework where LLMs self-improve their reasoning skills through co-evolving agents that generate, solve, and evaluate tasks without heavy reliance on human data.

Contribution

The paper introduces MAE, a novel multi-agent co-evolution framework enabling LLMs to self-improve across diverse reasoning tasks without extensive human supervision.

Findings

01

Achieves 4.54% average improvement on multiple benchmarks.

02

Demonstrates scalability and data efficiency in LLM self-improvement.

03

Operates effectively across mathematics, reasoning, and general knowledge tasks.

Abstract

Reinforcement Learning (RL) has demonstrated significant potential in enhancing the reasoning capabilities of large language models (LLMs). However, the success of RL for LLMs heavily relies on human-curated datasets and verifiable rewards, which limit their scalability and generality. Recent Self-Play RL methods, inspired by the success of the paradigm in games and Go, aim to enhance LLM reasoning capabilities without human-annotated data. However, their methods primarily depend on a grounded environment for feedback (e.g., a Python interpreter or a game engine); extending them to general domains remains challenging. To address these challenges, we propose Multi-Agent Evolve (MAE), a framework that enables LLMs to self-evolve in solving diverse tasks, including mathematics, reasoning, and general knowledge Q&A. The core design of MAE is based on a triplet of interacting agents…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.