Internalizing Agency from Reflective Experience
Rui Ge, Yichao Fu, Yuyang Qian, Junda Su, Yiming Zhao, Peng Zhao, Hao Zhang

TL;DR
This paper introduces LEAFE, a framework that enhances autonomous language models' problem-solving ability by internalizing recovery strategies from reflective experience, leading to improved long-horizon interaction performance.
Contribution
LEAFE is the first method to internalize recovery agency from reflective experience, enabling models to better utilize environment feedback for long-term problem solving.
Findings
LEAFE improves Pass@1 by up to 14% over base models.
LEAFE outperforms outcome-driven and experience-based baselines in diverse tasks.
LEAFE enhances the model's ability to recover and explore effectively in complex environments.
Abstract
Large language models are increasingly deployed as autonomous agents that must plan, act, and recover from mistakes through long-horizon interaction with environments that provide rich feedback. However, prevailing outcome-driven post-training methods (e.g., RL with verifiable rewards) primarily optimize final success signals, leaving rich environment feedback underutilized. Consequently, they often lead to distribution sharpening: the policy becomes better at reproducing a narrow set of already-successful behaviors, while failing to improve the feedback-grounded agency needed to expand problem-solving capacity (e.g., Pass@k) in long-horizon settings. To address this, we propose LEAFE (Learning Feedback-Grounded Agency from Reflective Experience), a framework that internalizes recovery agency from reflective experience. Specifically, during exploration, the agent summarizes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Explainable Artificial Intelligence (XAI)
