The Privacy Onion Effect: Memorization is Relative
Nicholas Carlini, Matthew Jagielski, Chiyuan Zhang, Nicolas Papernot,, Andreas Terzis, Florian Tramer

TL;DR
This paper introduces the Privacy Onion Effect, showing that removing outlier data points from models exposes new layers of data to privacy risks, challenging current defenses and privacy-preserving techniques.
Contribution
It uncovers the Onion Effect of memorization, revealing how privacy vulnerabilities shift layers when outliers are removed, and analyzes its implications for privacy defenses.
Findings
Removing outliers exposes new vulnerable data layers
Current privacy defenses may be ineffective against the Onion Effect
Privacy-preserving techniques like unlearning could inadvertently harm privacy
Abstract
Machine learning models trained on private datasets have been shown to leak their private data. While recent work has found that the average data point is rarely leaked, the outlier samples are frequently subject to memorization and, consequently, privacy leakage. We demonstrate and analyse an Onion Effect of memorization: removing the "layer" of outlier points that are most vulnerable to a privacy attack exposes a new layer of previously-safe points to the same attack. We perform several experiments to study this effect, and understand why it occurs. The existence of this effect has various consequences. For example, it suggests that proposals to defend against memorization without training with rigorous privacy guarantees are unlikely to be effective. Further, it suggests that privacy-enhancing technologies such as machine unlearning could actually harm the privacy of other users.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Ethics and Social Impacts of AI
