PRIME: Training Free Proactive Reasoning via Iterative Memory Evolution for User-Centric Agent
Prince Zizhuang Wang, Shuli Jiang

TL;DR
PRIME introduces a gradient-free, experience-based framework for developing proactive, user-centric agents capable of iterative reasoning and communication in complex tasks, emphasizing cost-efficiency and interpretability.
Contribution
It presents PRIME, a novel gradient-free learning method that leverages structured experience accumulation and retrieval to improve agent performance without expensive training.
Findings
PRIME achieves competitive results with gradient-based methods.
It offers cost-efficient and interpretable agent training.
Experiments demonstrate effectiveness across diverse environments.
Abstract
The development of autonomous tool-use agents for complex, long-horizon tasks in collaboration with human users has become the frontier of agentic research. During multi-turn Human-AI interactions, the dynamic and uncertain nature of user demands poses a significant challenge; agents must not only invoke tools but also iteratively refine their understanding of user intent through effective communication. While recent advances in reinforcement learning offer a path to more capable tool-use agents, existing approaches require expensive training costs and struggle with turn-level credit assignment across extended interaction horizons. To this end, we introduce PRIME (Proactive Reasoning via Iterative Memory Evolution), a gradient-free learning framework that enables continuous agent evolvement through explicit experience accumulation rather than expensive parameter optimization. PRIME…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
