AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Zhiheng Xi, Yiwen Ding, Wenxiang Chen, Boyang Hong, Honglin Guo,, Junzhe Wang, Dingwen Yang, Chenyang Liao, Xin Guo, Wei He, Songyang Gao, Lu, Chen, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xipeng Qiu, Xuanjing Huang,, Zuxuan Wu, Yu-Gang Jiang

TL;DR
This paper introduces AgentGym, a comprehensive framework for developing and evolving large language model-based agents across diverse environments, emphasizing self-evolution and broad generalization capabilities.
Contribution
It presents a new environment suite, a scalable evolution method, and a benchmark for training and assessing generalist LLM agents capable of self-evolution.
Findings
Evolved agents achieve results comparable to state-of-the-art models.
AgentGym enables broad exploration and learning across multiple environments.
The framework supports scalable self-evolution of agents beyond initial training data.
Abstract
Building generalist agents that can handle diverse tasks and evolve themselves across different environments is a long-term goal in the AI community. Large language models (LLMs) are considered a promising foundation to build such agents due to their generalized capabilities. Current approaches either have LLM-based agents imitate expert-provided trajectories step-by-step, requiring human supervision, which is hard to scale and limits environmental exploration; or they let agents explore and learn in isolated environments, resulting in specialist agents with limited generalization. In this paper, we take the first step towards building generally-capable LLM-based agents with self-evolution ability. We identify a trinity of ingredients: 1) diverse environments for agent exploration and learning, 2) a trajectory set to equip agents with basic capabilities and prior knowledge, and 3) an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗AgentGym/AgentEvol-7Bmodel· 1.0k dl· ♡ 71.0k dl♡ 7
- 🤗RichardErkhov/AgentGym_-_AgentEvol-7B-ggufmodel· 226 dl226 dl
- 🤗Alphatao/Affine-08990213model· 5 dl5 dl
- 🤗Alphatao/Affine-5647568model· 4 dl4 dl
- 🤗ATL-Machine/testmodelmodel
- 🤗olympus-ai/Affine-agent07-finetunemodel
- 🤗aiseosae/Affine-color3model
- 🤗aiseosae/Affine-5HSp1dWtGppxvnsRvDYsWMwWMihzZbftwUU12LGAfwhnECdpmodel
- 🤗aiseosae/Affine-1-5HSp1dWtGppxvnsRvDYsWMwWMihzZbftwUU12LGAfwhnECdpmodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsSparse Evolutionary Training
