RecMem: Recurrence-based Memory Consolidation for Efficient and Effective Long-Running LLM Agents

Zijie Dai; Shiyuan Deng; Sheng Guan; Yizhou Tian; Xin Yao; Xiao Yan; James Cheng

arXiv:2605.16045·cs.CL·May 18, 2026

RecMem: Recurrence-based Memory Consolidation for Efficient and Effective Long-Running LLM Agents

Zijie Dai, Shiyuan Deng, Sheng Guan, Yizhou Tian, Xin Yao, Xiao Yan, James Cheng

PDF

TL;DR

RecMem introduces a recurrence-based memory consolidation method for long-running LLM agents, significantly reducing token costs while maintaining or improving memory accuracy by selectively invoking LLMs based on interaction recurrence.

Contribution

It proposes a novel recurrence-based memory consolidation approach that minimizes token consumption and enhances memory accuracy in long-running LLM agents.

Findings

01

RecMem reduces memory construction token cost by up to 87%.

02

RecMem exceeds the accuracy of three state-of-the-art memory systems.

03

Selective invocation of LLMs based on recurrence improves efficiency.

Abstract

Memory systems often organize user-agent interactions as retrievable external memory and are crucial for long-running agents by overcoming the limited context windows of LLMs. However, existing memory systems invoke LLMs to process every incoming interaction for memory extraction, and such an eager memory consolidation scheme leads to substantial token consumption. To tackle this problem, we propose RecMem by rethinking when memory consolidation should be conducted. RecMem stores incoming interactions in a subconscious memory layer and encode them using lightweight embedding models for retrieval. LLMs are only invoked to extract episodic and semantic memory when sustained recurrence are observed for semantically similar interactions. Such recurrence-based consolidation works because these interactions correspond to a semantic cluster with rich information and thus are worth extraction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.