When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Leheng Sheng; Yongtao Zhang; Wenchang Ma; Yaorui Shi; Ting Huang; Xiang Wang; An Zhang; Ke Shen; Tat-Seng Chua

arXiv:2602.10560·cs.CL·February 12, 2026

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Leheng Sheng, Yongtao Zhang, Wenchang Ma, Yaorui Shi, Ting Huang, Xiang Wang, An Zhang, Ke Shen, Tat-Seng Chua

PDF

Open Access

TL;DR

This paper introduces GRU-Mem, a gated recurrent memory model that improves long-context reasoning in large language models by controlling memory updates and exits, leading to better performance and efficiency.

Contribution

The paper proposes a novel gated recurrent memory mechanism with text-controlled gates and reinforcement learning signals to enhance long-context reasoning in LLMs.

Findings

01

GRU-Mem outperforms MemAgent in reasoning tasks.

02

Achieves up to 400% faster inference speed.

03

Demonstrates improved stability and efficiency.

Abstract

While reasoning over long context is crucial for various real-world applications, it remains challenging for large language models (LLMs) as they suffer from performance degradation as the context length grows. Recent work MemAgent has tried to tackle this by processing context chunk-by-chunk in an RNN-like loop and updating a textual memory for final answering. However, this naive recurrent memory update faces two crucial drawbacks: (i) memory can quickly explode because it can update indiscriminately, even on evidence-free chunks; and (ii) the loop lacks an exit mechanism, leading to unnecessary computation after even sufficient evidence is collected. To address these issues, we propose GRU-Mem, which incorporates two text-controlled gates for more stable and efficient long-context reasoning. Specifically, in GRU-Mem, the memory only updates when the update gate is open and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)