Evolving-RL: End-to-End Optimization of Experience-Driven Self-Evolving Capability within Agents

Zhiyuan Fan; Wenwei Jin; Feng Zhang; Bin Li; Yihong Dong; Yao Hu; Jiawei Li

arXiv:2605.10663·cs.AI·May 12, 2026

Evolving-RL: End-to-End Optimization of Experience-Driven Self-Evolving Capability within Agents

Zhiyuan Fan, Wenwei Jin, Feng Zhang, Bin Li, Yihong Dong, Yao Hu, Jiawei Li

PDF

TL;DR

Evolving-RL is a unified framework that enhances self-evolving agents by jointly optimizing experience extraction and utilization, significantly improving out-of-distribution task performance in large language models.

Contribution

It introduces a co-evolutionary approach to optimize experience extraction and reuse within agents, addressing limitations of prior isolated or system-level methods.

Findings

01

Achieves up to 98.7% improvement on ALFWorld unseen tasks.

02

Improves performance on Mind2Web by 35.8%.

03

Functions as an experience-augmented RL algorithm with strong results.

Abstract

Experience-driven self-evolving agents aim to overcome the static nature of large language models by distilling reusable experience from past interactions, thus enabling adaptation to novel tasks at deployment time. This process places substantial demands on the foundation model's capacities for abstraction, generalization, and in-context learning. However, most existing studies focus primarily on system-level design choices, such as how experience is represented and managed, neglecting the inherent capabilities of the underlying model. While some recent works have started to optimize the experience utilization stage via reinforcement learning, they still fail to treat self-evolution as a unified process to be jointly optimized. To this end, we propose Evolving-RL, an efficient algorithmic framework that jointly improves the experience extraction and utilization capabilities required…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.