Dense Associative Memory is Robust to Adversarial Inputs
Dmitry Krotov, John J Hopfield

TL;DR
This paper demonstrates that Dense Associative Memory models with higher order interactions are more robust to adversarial inputs and produce semantically meaningful minima, aligning more closely with human perception than traditional DNNs.
Contribution
The study shows that higher order energy functions in DAM models eliminate rubbish minima and resist adversarial attacks, improving robustness and perceptual alignment.
Findings
Higher order DAM models have semantically meaningful minima.
Models with higher order interactions resist transfer of adversarial images.
DAM models with higher order interactions are closer to human perception.
Abstract
Deep neural networks (DNN) trained in a supervised way suffer from two known problems. First, the minima of the objective function used in learning correspond to data points (also known as rubbish examples or fooling images) that lack semantic similarity with the training data. Second, a clean input can be changed by a small, and often imperceptible for human vision, perturbation, so that the resulting deformed input is misclassified by the network. These findings emphasize the differences between the ways DNN and humans classify patterns, and raise a question of designing learning algorithms that more accurately mimic human perception compared to the existing methods. Our paper examines these questions within the framework of Dense Associative Memory (DAM) models. These models are defined by the energy function, with higher order (higher than quadratic) interactions between the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
