MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory

Ali Modarressi; Abdullatif K\"oksal; Ayyoob Imani; Mohsen Fayyaz,; Hinrich Sch\"utze

arXiv:2404.11672·cs.CL·April 18, 2025·2 cites

MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory

Ali Modarressi, Abdullatif K\"oksal, Ayyoob Imani, Mohsen Fayyaz,, Hinrich Sch\"utze

PDF

Open Access 1 Repo

TL;DR

MemLLM introduces a structured read-write memory module to enhance LLMs, enabling better memorization, updating, and interpretability, thereby improving performance on knowledge-intensive tasks.

Contribution

This paper presents MemLLM, a novel explicit memory augmentation method that allows dynamic interaction, improving LLMs' factual accuracy and interpretability.

Findings

01

Enhanced performance on knowledge tasks

02

Improved interpretability of LLMs

03

Better handling of rare and updated facts

Abstract

While current large language models (LLMs) perform well on many knowledge-related tasks, they are limited by relying on their parameters as an implicit storage mechanism. As a result, they struggle with memorizing rare events and with updating their memory as facts change over time. In addition, the uninterpretable nature of parametric memory makes it challenging to prevent hallucination. Model editing and augmenting LLMs with parameters specialized for memory are only partial solutions. In this paper, we introduce MemLLM, a novel method of enhancing LLMs by integrating a structured and explicit read-and-write memory module. MemLLM tackles the aforementioned challenges by enabling dynamic interaction with the memory and improving the LLM's capabilities in using stored knowledge. Our experiments indicate that MemLLM enhances the LLM's performance and interpretability, in language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amodaresi/memllm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications