Larimar: Large Language Models with Episodic Memory Control
Payel Das, Subhajit Chaudhury, Elliot Nelson, Igor Melnyk and, Sarath Swaminathan, Sihui Dai, Aur\'elie Lozano, Georgios Kollias, and Vijil Chenthamarakshan, Ji\v{r}\'i, Navr\'atil, Soham Dan and, Pin-Yu Chen

TL;DR
Larimar introduces a brain-inspired episodic memory architecture for LLMs that enables fast, accurate, and flexible knowledge updates without retraining, significantly improving efficiency and adaptability.
Contribution
Larimar presents a novel, simple, and LLM-agnostic episodic memory system that allows dynamic knowledge updates, selective forgetting, and context length generalization in large language models.
Findings
Achieves comparable accuracy to baselines in fact editing tasks.
Provides 8-10x speed-up in knowledge updating.
Demonstrates effective mechanisms for forgetting and context management.
Abstract
Efficient and accurate updating of knowledge stored in Large Language Models (LLMs) is one of the most pressing research challenges today. This paper presents Larimar - a novel, brain-inspired architecture for enhancing LLMs with a distributed episodic memory. Larimar's memory allows for dynamic, one-shot updates of knowledge without the need for computationally expensive re-training or fine-tuning. Experimental results on multiple fact editing benchmarks demonstrate that Larimar attains accuracy comparable to most competitive baselines, even in the challenging sequential editing setup, but also excels in speed - yielding speed-ups of 8-10x depending on the base LLM - as well as flexibility due to the proposed architecture being simple, LLM-agnostic, and hence general. We further provide mechanisms for selective fact forgetting, information leakage prevention, and input context length…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Balanced Selection
