MeMo: Towards Language Models with Associative Memory Mechanisms
Fabio Massimo Zanzotto, Elena Sofia Ruzzetti, Giancarlo A. Xompero, Leonardo Ranaldi, Davide Venditti, Federico Ranaldi, Cristina Giannone, Andrea Favalli, Raniero Romagnoli

TL;DR
This paper introduces MeMo, a new language model architecture that explicitly memorizes text sequences using layered associative memories, enhancing transparency and enabling model editing like forgetting specific texts.
Contribution
Proposes MeMo, a novel architecture for language models that directly memorizes text sequences with layered associative memories, shifting from learning-based memorization.
Findings
MeMo demonstrates strong memorization capabilities in various configurations.
The architecture offers transparency and facilitates model editing such as forgetting.
Experimental results validate the effectiveness of MeMo in memorization tasks.
Abstract
Memorization is a fundamental ability of Transformer-based Large Language Models, achieved through learning. In this paper, we propose a paradigm shift by designing an architecture to memorize text directly, bearing in mind the principle that memorization precedes learning. We introduce MeMo, a novel architecture for language modeling that explicitly memorizes sequences of tokens in layered associative memories. By design, MeMo offers transparency and the possibility of model editing, including forgetting texts. We experimented with the MeMo architecture, showing the memorization power of the one-layer and the multi-layer configurations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
