Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Guanting Dong; Junting Lu; Junjie Huang; Wanjun Zhong; Longxiang Liu; Shijue Huang; Zhenyu Li; Yang Zhao; Xiaoshuai Song; Xiaoxi Li; Jiajie Jin; Yutao Zhu; Hanbin Wang; Fangyu Lei; Qinyu Luo; Mingyang Chen; Zehui Chen; Jiazhan Feng; Ji-Rong Wen; Zhicheng Dou

arXiv:2604.18292·cs.AI·April 21, 2026

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Guanting Dong, Junting Lu, Junjie Huang, Wanjun Zhong, Longxiang Liu, Shijue Huang, Zhenyu Li, Yang Zhao, Xiaoshuai Song, Xiaoxi Li, Jiajie Jin, Yutao Zhu, Hanbin Wang, Fangyu Lei, Qinyu Luo, Mingyang Chen, Zehui Chen, Jiazhan Feng, Ji-Rong Wen, Zhicheng Dou

PDF

TL;DR

Agent-World introduces a scalable, self-evolving environment platform that autonomously generates tasks and environments to enhance general agent training and capabilities.

Contribution

It presents a novel self-evolving training arena combining environment discovery and continuous reinforcement learning for advancing general agent intelligence.

Findings

01

Agent-World outperforms proprietary models on 23 benchmarks.

02

Scaling trends relate to environment diversity and self-evolution rounds.

03

The platform enables co-evolution of agent policies and environments.

Abstract

Large language models are increasingly expected to serve as general-purpose agents that interact with external, stateful tool environments. The Model Context Protocol (MCP) and broader agent skills offer a unified interface for connecting agents with scalable real-world services, but training robust agents remains limited by the lack of realistic environments and principled mechanisms for life-long learning. In this paper, we present \textbf{Agent-World}, a self-evolving training arena for advancing general agent intelligence through scalable environments. Agent-World has two main components: (1) Agentic Environment-Task Discovery, which autonomously explores topic-aligned databases and executable tool ecosystems from thousands of real-world environment themes and synthesizes verifiable tasks with controllable difficulty; and (2) Continuous Self-Evolving Agent Training, which combines…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.