Agentic Critical Training
Weize Liu, Minghui Liu, Sy-Tuyen Ho, Souradip Chakraborty, Xiyao Wang, Furong Huang

TL;DR
Agentic Critical Training (ACT) is a reinforcement learning approach that enhances large language model agents by training them to autonomously identify better actions, leading to improved performance and reasoning capabilities.
Contribution
The paper introduces ACT, a novel reinforcement learning paradigm that promotes autonomous reasoning about action quality in LLM agents, surpassing imitation and reflection-based methods.
Findings
Improves agent performance by an average of 5.07 points over imitation learning.
Enhances out-of-distribution generalization and reasoning without specialized training data.
Demonstrates advantages over knowledge distillation approaches.
Abstract
Training large language models (LLMs) as autonomous agents often begins with imitation learning, but it only teaches agents what to do without understanding why: agents never contrast successful actions against suboptimal alternatives and thus lack awareness of action quality. Recent approaches attempt to address this by introducing self-reflection supervision derived from contrasts between expert and alternative actions. However, the training paradigm fundamentally remains imitation learning: the model imitates pre-constructed reflection text rather than learning to reason autonomously. We propose Agentic Critical Training (ACT), a reinforcement learning paradigm that trains agents to identify the better action among alternatives. By rewarding whether the model's judgment is correct, ACT drives the model to autonomously develop reasoning about action quality, producing genuine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Multi-Agent Systems and Negotiation
