Overparameterized Neural Networks Implement Associative Memory

Adityanarayanan Radhakrishnan; Mikhail Belkin; Caroline Uhler

arXiv:1909.12362·cs.LG·May 25, 2022

Overparameterized Neural Networks Implement Associative Memory

Adityanarayanan Radhakrishnan, Mikhail Belkin, Caroline Uhler

PDF

1 Repo

TL;DR

This paper demonstrates that overparameterized neural networks naturally implement associative memory mechanisms, effectively storing and retrieving data and sequences through attractor dynamics, with theoretical proofs supporting these findings.

Contribution

It reveals that standard overparameterized neural networks inherently function as associative memory systems, both empirically and theoretically, for real-valued data.

Findings

01

Autoencoders store training samples as attractors.

02

Sequence encoding is more efficient than autoencoding for memory.

03

Theoretical proof of attractor storage for single-example autoencoders.

Abstract

Identifying computational mechanisms for memorization and retrieval of data is a long-standing problem at the intersection of machine learning and neuroscience. Our main finding is that standard overparameterized deep neural networks trained using standard optimization methods implement such a mechanism for real-valued data. Empirically, we show that: (1) overparameterized autoencoders store training samples as attractors, and thus, iterating the learned map leads to sample recovery; (2) the same mechanism allows for encoding sequences of examples, and serves as an even more efficient mechanism for memory than autoencoding. Theoretically, we prove that when trained on a single example, autoencoders store the example as an attractor. Lastly, by treating a sequence encoder as a composition of maps, we prove that sequence encoding provides a more efficient mechanism for memory than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uhlerlab/neural_networks_associative_memory
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.