Agent-R: Training Language Model Agents to Reflect via Iterative   Self-Training

Siyu Yuan; Zehui Chen; Zhiheng Xi; Junjie Ye; Zhengyin Du; Jiecao Chen

arXiv:2501.11425·cs.AI·March 25, 2025

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Siyu Yuan, Zehui Chen, Zhiheng Xi, Junjie Ye, Zhengyin Du, Jiecao Chen

PDF

Open Access 1 Repo

TL;DR

Agent-R introduces an iterative self-training framework for language model agents that uses MCTS and model-guided critique to improve error recovery and decision-making in interactive environments.

Contribution

This work presents a novel self-reflection and iterative self-training method for language agents, enabling dynamic error correction and improved performance.

Findings

01

Agent-R improves error recovery capabilities in language agents.

02

The framework achieves a +5.59% performance gain over baselines.

03

Iterative refinement enhances both error correction and dataset quality.

Abstract

Large Language Models (LLMs) agents are increasingly pivotal for addressing complex tasks in interactive environments. Existing work mainly focuses on enhancing performance through behavior cloning from stronger experts, yet such approaches often falter in real-world applications, mainly due to the inability to recover from errors. However, step-level critique data is difficult and expensive to collect. Automating and dynamically constructing self-critique datasets is thus crucial to empowering models with intelligent agent capabilities. In this work, we propose an iterative self-training framework, Agent-R, that enables language Agent to Reflect on the fly. Unlike traditional methods that reward or penalize actions based on correctness, Agent-R leverages MCTS to construct training data that recover correct trajectories from erroneous ones. A key challenge of agent reflection lies in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bytedance/agent-r
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques