Can Neural Network Memorization Be Localized?

Pratyush Maini; Michael C. Mozer; Hanie Sedghi; Zachary C. Lipton; J.; Zico Kolter; Chiyuan Zhang

arXiv:2307.09542·cs.LG·July 20, 2023·1 cites

Can Neural Network Memorization Be Localized?

Pratyush Maini, Michael C. Mozer, Hanie Sedghi, Zachary C. Lipton, J., Zico Kolter, Chiyuan Zhang

PDF

Open Access 1 Repo

TL;DR

This paper investigates the localization of memorization in neural networks, revealing that it is confined to small neuron sets across various layers, and introduces a new dropout method to control this memorization.

Contribution

The study demonstrates that memorization is localized to specific neurons rather than entire layers and proposes example-tied dropout to mitigate memorization effects.

Findings

01

Memorization is confined to a small set of neurons or channels.

02

Most layers are redundant for memorization, often not the final layers.

03

Example-tied dropout significantly reduces memorization accuracy.

Abstract

Recent efforts at explaining the interplay of memorization and generalization in deep overparametrized networks have posited that neural networks $memorize$ "hard" examples in the final few layers of the model. Memorization refers to the ability to correctly predict on $atypical$ examples of the training set. In this work, we show that rather than being confined to individual layers, memorization is a phenomenon confined to a small set of neurons in various layers of the model. First, via three experimental sources of converging evidence, we find that most layers are redundant for the memorization of examples and the layers that contribute to example memorization are, in general, not the final layers. The three sources are $gradient accounting$ (measuring the contribution to the gradient norms from memorized and clean examples), $layer rewinding$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pratyushmaini/localizing-memorization
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Model Reduction and Neural Networks

MethodsDropout