Can Neural Network Memorization Be Localized?
Pratyush Maini, Michael C. Mozer, Hanie Sedghi, Zachary C. Lipton, J., Zico Kolter, Chiyuan Zhang

TL;DR
This paper investigates the localization of memorization in neural networks, revealing that it is confined to small neuron sets across various layers, and introduces a new dropout method to control this memorization.
Contribution
The study demonstrates that memorization is localized to specific neurons rather than entire layers and proposes example-tied dropout to mitigate memorization effects.
Findings
Memorization is confined to a small set of neurons or channels.
Most layers are redundant for memorization, often not the final layers.
Example-tied dropout significantly reduces memorization accuracy.
Abstract
Recent efforts at explaining the interplay of memorization and generalization in deep overparametrized networks have posited that neural networks "hard" examples in the final few layers of the model. Memorization refers to the ability to correctly predict on examples of the training set. In this work, we show that rather than being confined to individual layers, memorization is a phenomenon confined to a small set of neurons in various layers of the model. First, via three experimental sources of converging evidence, we find that most layers are redundant for the memorization of examples and the layers that contribute to example memorization are, in general, not the final layers. The three sources are (measuring the contribution to the gradient norms from memorized and clean examples), …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Model Reduction and Neural Networks
MethodsDropout
