Factual recall in linear associative memories: sharp asymptotics and mechanistic insights
Alessio Giorlandino, Sebastian Goldt, Antoine Maillard

TL;DR
This paper precisely characterizes the storage capacity limits of linear associative memories for factual recall, revealing how optimal solutions outperform naive learning by raising correct scores above competition thresholds.
Contribution
It introduces a decoupled model equivalent to the original, providing sharp capacity limits and mechanistic insights into optimal solutions in linear associative memories.
Findings
Capacity up to p_c log p_c / d^2 = 1/2 associations
Decoupled model matches original in capacity and spectra
Optimal solutions raise correct scores above competition threshold
Abstract
Large language models demonstrate remarkable ability in factual recall, yet the fundamental limits of storing and retrieving input--output associations with neural networks remain unclear. We study these limits in a minimal setting: a linear associative memory that maps input embeddings in to their corresponding~-dimensional targets via a single layer, requiring each mapped input to be well separated from all other targets. Unlike in supervised classification, this strict separation induces~ constraints per association and produces strong correlations between constraints that make a direct characterisation of the storage capacity difficult. Here, we provide a precise characterisation of this capacity in the following way. We first introduce a decoupled model in which each input has its own independent set of competing outputs, and provide numerical and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
