PRIME: Training Free Proactive Reasoning via Iterative Memory Evolution for User-Centric Agent

Prince Zizhuang Wang; Shuli Jiang

arXiv:2604.07645·cs.AI·April 10, 2026

PRIME: Training Free Proactive Reasoning via Iterative Memory Evolution for User-Centric Agent

Prince Zizhuang Wang, Shuli Jiang

PDF

TL;DR

PRIME introduces a gradient-free, experience-based framework for developing proactive, user-centric agents capable of iterative reasoning and communication in complex tasks, emphasizing cost-efficiency and interpretability.

Contribution

It presents PRIME, a novel gradient-free learning method that leverages structured experience accumulation and retrieval to improve agent performance without expensive training.

Findings

01

PRIME achieves competitive results with gradient-based methods.

02

It offers cost-efficient and interpretable agent training.

03

Experiments demonstrate effectiveness across diverse environments.

Abstract

The development of autonomous tool-use agents for complex, long-horizon tasks in collaboration with human users has become the frontier of agentic research. During multi-turn Human-AI interactions, the dynamic and uncertain nature of user demands poses a significant challenge; agents must not only invoke tools but also iteratively refine their understanding of user intent through effective communication. While recent advances in reinforcement learning offer a path to more capable tool-use agents, existing approaches require expensive training costs and struggle with turn-level credit assignment across extended interaction horizons. To this end, we introduce PRIME (Proactive Reasoning via Iterative Memory Evolution), a gradient-free learning framework that enables continuous agent evolvement through explicit experience accumulation rather than expensive parameter optimization. PRIME…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.