MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory
Ali Modarressi, Abdullatif K\"oksal, Ayyoob Imani, Mohsen Fayyaz,, Hinrich Sch\"utze

TL;DR
MemLLM introduces a structured read-write memory module to enhance LLMs, enabling better memorization, updating, and interpretability, thereby improving performance on knowledge-intensive tasks.
Contribution
This paper presents MemLLM, a novel explicit memory augmentation method that allows dynamic interaction, improving LLMs' factual accuracy and interpretability.
Findings
Enhanced performance on knowledge tasks
Improved interpretability of LLMs
Better handling of rare and updated facts
Abstract
While current large language models (LLMs) perform well on many knowledge-related tasks, they are limited by relying on their parameters as an implicit storage mechanism. As a result, they struggle with memorizing rare events and with updating their memory as facts change over time. In addition, the uninterpretable nature of parametric memory makes it challenging to prevent hallucination. Model editing and augmenting LLMs with parameters specialized for memory are only partial solutions. In this paper, we introduce MemLLM, a novel method of enhancing LLMs by integrating a structured and explicit read-and-write memory module. MemLLM tackles the aforementioned challenges by enabling dynamic interaction with the memory and improving the LLM's capabilities in using stored knowledge. Our experiments indicate that MemLLM enhances the LLM's performance and interpretability, in language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
