TL;DR
HeLa-Mem introduces a bio-inspired memory system for LLMs that models associative memory as a dynamic graph with Hebbian learning, improving long-term coherence and efficiency.
Contribution
It presents a novel memory architecture combining graph-based associative memory with Hebbian learning, inspired by cognitive neuroscience, for enhanced LLM long-term memory.
Findings
HeLa-Mem outperforms existing methods on LoCoMo across four question categories.
Uses fewer context tokens to achieve superior performance.
Employs a dual-level memory organization with episodic and semantic components.
Abstract
Long-term memory is a critical challenge for Large Language Model agents, as fixed context windows cannot preserve coherence across extended interactions. Existing memory systems represent conversation history as unstructured embedding vectors, retrieving information through semantic similarity. This paradigm fails to capture the associative structure of human memory, wherein related experiences progressively strengthen interconnections through repeated co-activation. Inspired by cognitive neuroscience, we identify three mechanisms central to biological memory: association, consolidation, and spreading activation, which remain largely absent in current research. To bridge this gap, we propose HeLa-Mem, a bio-inspired memory architecture that models memory as a dynamic graph with Hebbian learning dynamics. HeLa-Mem employs a dual-level organization: (1) an episodic memory graph that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
