Localizing Memorization in SSL Vision Encoders
Wenhao Wang, Adam Dziedzic, Michael Backes, Franziska Boenisch

TL;DR
This paper introduces methods to localize where memorization occurs within SSL vision encoders, revealing that memorization is widespread across layers and units, especially for outlier data, and varies by architecture.
Contribution
The paper proposes novel, task-independent metrics for localizing memorization in SSL encoders at layer and unit levels, applicable to various architectures and datasets.
Findings
Memorization increases with layer depth in SSL encoders.
A significant fraction of units exhibit high memorization, unlike supervised models.
Outlier data points cause higher memorization in layers and units.
Abstract
Recent work on studying memorization in self-supervised learning (SSL) suggests that even though SSL encoders are trained on millions of images, they still memorize individual data points. While effort has been put into characterizing the memorized data and linking encoder memorization to downstream utility, little is known about where the memorization happens inside SSL encoders. To close this gap, we propose two metrics for localizing memorization in SSL encoders on a per-layer (layermem) and per-unit basis (unitmem). Our localization methods are independent of the downstream task, do not require any label information, and can be performed in a forward pass. By localizing memorization in various encoder architectures (convolutional and transformer-based) trained on diverse datasets with contrastive and non-contrastive SSL frameworks, we find that (1) while SSL memorization increases…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Image Processing Techniques · Advanced Image and Video Retrieval Techniques · Parallel Computing and Optimization Techniques
MethodsPruning
