R$^3$Mem: Bridging Memory Retention and Retrieval via Reversible Compression
Xiaoqiang Wang, Suyuchen Wang, Yun Zhu, Bang Liu

TL;DR
R$^3$Mem introduces a reversible, hierarchical compression-based memory network that enhances long-term retention and retrieval in language models, achieving state-of-the-art results in long-context tasks.
Contribution
It proposes a novel reversible memory architecture with hierarchical compression for improved information retention and retrieval in language models.
Findings
Achieves state-of-the-art performance in long-context language modeling.
Outperforms traditional memory modules in long-horizon interactions.
Seamlessly integrates with Transformer models via parameter-efficient fine-tuning.
Abstract
Memory plays a key role in enhancing LLMs' performance when deployed to real-world applications. Existing solutions face trade-offs: explicit memory designs based on external storage require complex management and incur storage overhead, while implicit memory designs that store information via parameters struggle with reliable retrieval. In this paper, we propose RMem, a memory network that optimizes both information Retention and Retrieval through Reversible context compression. Specifically, RMem employs virtual memory tokens to compress and encode infinitely long histories, further enhanced by a hierarchical compression strategy that refines information from document- to entity-level for improved assimilation across granularities. For retrieval, RMem employs a reversible architecture, reconstructing raw data by invoking the model backward with compressed information.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Topic Modeling · Advanced Image and Video Retrieval Techniques
MethodsMemory Network
