Nonparametric Modern Hopfield Models
Jerry Yao-Chieh Hu, Bo-Yu Chen, Dennis Wu, Feng Ruan, Han Liu

TL;DR
This paper introduces a nonparametric interpretation of modern Hopfield models, leading to efficient sparse variants with theoretical guarantees and practical validation in memory retrieval and learning tasks.
Contribution
It provides a nonparametric perspective on modern Hopfield models, enabling the development of efficient sparse variants with strong theoretical properties.
Findings
Sparse modern Hopfield models have sub-quadratic complexity.
The models retain key properties like connection to transformer attention.
Empirical validation shows effectiveness in retrieval and learning tasks.
Abstract
We present a nonparametric interpretation for deep learning compatible modern Hopfield models and utilize this new perspective to debut efficient variants. Our key contribution stems from interpreting the memory storage and retrieval processes in modern Hopfield models as a nonparametric regression problem subject to a set of query-memory pairs. Interestingly, our framework not only recovers the known results from the original dense modern Hopfield model but also fills the void in the literature regarding efficient modern Hopfield models, by introducing \textit{sparse-structured} modern Hopfield models with sub-quadratic complexity. We establish that this sparse model inherits the appealing theoretical properties of its dense analogue -- connection with transformer attention, fixed point convergence and exponential memory capacity. Additionally, we showcase the versatility of our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Time Series Analysis
MethodsSparse Evolutionary Training
