AgentGym: Evolving Large Language Model-based Agents across Diverse   Environments

Zhiheng Xi; Yiwen Ding; Wenxiang Chen; Boyang Hong; Honglin Guo,; Junzhe Wang; Dingwen Yang; Chenyang Liao; Xin Guo; Wei He; Songyang Gao; Lu; Chen; Rui Zheng; Yicheng Zou; Tao Gui; Qi Zhang; Xipeng Qiu; Xuanjing Huang,; Zuxuan Wu; Yu-Gang Jiang

arXiv:2406.04151·cs.AI·June 7, 2024·3 cites

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo,, Junzhe Wang, Dingwen Yang, Chenyang Liao, Xin Guo, Wei He, Songyang Gao, Lu, Chen, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang,, Zuxuan Wu, Yu-Gang Jiang

PDF

Open Access 1 Repo 9 Models 3 Datasets

TL;DR

This paper introduces AgentGym, a comprehensive framework for developing and evolving large language model-based agents across diverse environments, emphasizing self-evolution and broad generalization capabilities.

Contribution

It presents a new environment suite, a scalable evolution method, and a benchmark for training and assessing generalist LLM agents capable of self-evolution.

Findings

01

Evolved agents achieve results comparable to state-of-the-art models.

02

AgentGym enables broad exploration and learning across multiple environments.

03

The framework supports scalable self-evolution of agents beyond initial training data.

Abstract

Building generalist agents that can handle diverse tasks and evolve themselves across different environments is a long-term goal in the AI community. Large language models (LLMs) are considered a promising foundation to build such agents due to their generalized capabilities. Current approaches either have LLM-based agents imitate expert-provided trajectories step-by-step, requiring human supervision, which is hard to scale and limits environmental exploration; or they let agents explore and learn in isolated environments, resulting in specialist agents with limited generalization. In this paper, we take the first step towards building generally-capable LLM-based agents with self-evolution ability. We identify a trinity of ingredients: 1) diverse environments for agent exploration and learning, 2) a trajectory set to equip agents with basic capabilities and prior knowledge, and 3) an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

woooodyy/agentgym
noneOfficial

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsSparse Evolutionary Training