Loading paper

$\delta$-mem: Efficient Online Memory for Large Language Models | Tomesphere

arXiv:2605.12357·cs.AI·May 13, 2026

$\delta$-mem: Efficient Online Memory for Large Language Models

Jingdi Lei, Di Zhang, Junxian Li, Weida Wang, Kaixuan Fan, Xiang Liu, Qihan Liu, Xiaoteng Ma, Baian Chen, Soujanya Poria

1 Repo 2 Models 1 Datasets

TL;DR

$4mem4 is a lightweight online memory mechanism that enhances large language models' ability to utilize historical information efficiently without full fine-tuning.

Contribution

It introduces $4mem, a compact online associative memory that improves long-term information retention in language models with minimal overhead.

Findings

01

$4mem$ improves model scores by 1.10x on average over the backbone.

02

It achieves 1.31x improvement on MemoryAgentBench.

03

Effective memory can be integrated without full fine-tuning or context extension.

Abstract

Large language models increasingly need to accumulate and reuse historical information in long-term assistants and agent systems. Simply expanding the context window is costly and often fails to ensure effective context utilization. We propose $δ$ -mem, a lightweight memory mechanism that augments a frozen full-attention backbone with a compact online state of associative memory. $δ$ -mem compresses past information into a fixed-size state matrix updated by delta-rule learning, and uses its readout to generate low-rank corrections to the backbone's attention computation during generation. With only an $8 \times 8$ online memory state, $δ$ -mem improves the average score to $1.10 \times$ that of the frozen backbone and $1.15 \times$ that of the strongest non- $δ$ -mem memory baseline. It achieves larger gains on memory-heavy benchmarks, reaching $1.31 \times$ on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

declare-lab/delta-Mem
github

Models

Datasets

huaXiaKyrie/delta-mem-qasper-data
dataset· 22 dl
22 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.