R^2-Mem: Reflective Experience for Memory Search

Xinyuan Wang; Wenyu Mao; Junkang Wu; Xiang Wang; Xiangnan He

arXiv:2605.13486·cs.CL·May 14, 2026

R^2-Mem: Reflective Experience for Memory Search

Xinyuan Wang, Wenyu Mao, Junkang Wu, Xiang Wang, Xiangnan He

PDF

TL;DR

R^2-Mem introduces a reflective experience framework for memory search that improves accuracy and efficiency of agents by learning from past search trajectories without reinforcement learning.

Contribution

The paper presents R^2-Mem, a novel RL-free, low-cost framework that enhances memory search systems through reflective experience and self-guided improvement.

Findings

01

F1 scores improved by up to 22.6%

02

Token consumption reduced by 12.9%

03

Search iterations decreased by 20.2%

Abstract

Deep search has recently emerged as a promising paradigm for enabling agents to retrieve fine-grained historical information without heavy memory pre-managed. However, existing deep search agents for memory system repeat past error behaviors because they fail to learn from the prior high- and low-quality search trajectories. To address this limitation, we propose R^2-Mem, a reflective experience framework for memory search systems. In the offline stage, a Rubric-guided Evaluator scores low- and high-quality steps in historical trajectories, and a self-Reflection Learner distills the corresponding abstract experience. During the online inference, the retrieved experience will guide future search actions to avoid repeated mistakes and maintain high-quality behaviors. Extensive experiments demonstrate that R^2-Mem consistently improves both effectiveness and efficiency over strong…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.