Larimar: Large Language Models with Episodic Memory Control

Payel Das; Subhajit Chaudhury; Elliot Nelson; Igor Melnyk and; Sarath Swaminathan; Sihui Dai; Aur\'elie Lozano; Georgios Kollias; and Vijil Chenthamarakshan; Ji\v{r}\'i; Navr\'atil; Soham Dan and; Pin-Yu Chen

arXiv:2403.11901·cs.LG·August 23, 2024·2 cites

Larimar: Large Language Models with Episodic Memory Control

Payel Das, Subhajit Chaudhury, Elliot Nelson, Igor Melnyk and, Sarath Swaminathan, Sihui Dai, Aur\'elie Lozano, Georgios Kollias, and Vijil Chenthamarakshan, Ji\v{r}\'i, Navr\'atil, Soham Dan and, Pin-Yu Chen

PDF

Open Access 1 Repo

TL;DR

Larimar introduces a brain-inspired episodic memory architecture for LLMs that enables fast, accurate, and flexible knowledge updates without retraining, significantly improving efficiency and adaptability.

Contribution

Larimar presents a novel, simple, and LLM-agnostic episodic memory system that allows dynamic knowledge updates, selective forgetting, and context length generalization in large language models.

Findings

01

Achieves comparable accuracy to baselines in fact editing tasks.

02

Provides 8-10x speed-up in knowledge updating.

03

Demonstrates effective mechanisms for forgetting and context management.

Abstract

Efficient and accurate updating of knowledge stored in Large Language Models (LLMs) is one of the most pressing research challenges today. This paper presents Larimar - a novel, brain-inspired architecture for enhancing LLMs with a distributed episodic memory. Larimar's memory allows for dynamic, one-shot updates of knowledge without the need for computationally expensive re-training or fine-tuning. Experimental results on multiple fact editing benchmarks demonstrate that Larimar attains accuracy comparable to most competitive baselines, even in the challenging sequential editing setup, but also excels in speed - yielding speed-ups of 8-10x depending on the base LLM - as well as flexibility due to the proposed architecture being simple, LLM-agnostic, and hence general. We further provide mechanisms for selective fact forgetting, information leakage prevention, and input context length…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ibm/larimar
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Balanced Selection