Tight Bounds for Machine Unlearning via Differential Privacy
Yiyang Huang, Cl\'ement L. Canonne

TL;DR
This paper establishes tight bounds on the number of data points that can be effectively unlearned by differentially private algorithms, advancing the theoretical understanding of machine unlearning's limits.
Contribution
It provides the first complete characterization of the deletion capacity for DP-based machine unlearning, closing previous bounds gap.
Findings
Tight bounds on unlearning capacity established
Theoretical limits for DP algorithms in unlearning clarified
Implications for privacy-preserving machine learning discussed
Abstract
We consider the formulation of "machine unlearning" of Sekhari, Acharya, Kamath, and Suresh (NeurIPS 2021), which formalizes the so-called "right to be forgotten" by requiring that a trained model, upon request, should be able to "unlearn" a number of points from the training data, as if they had never been included in the first place. Sekhari et al. established some positive and negative results about the number of data points that can be successfully unlearnt by a trained model without impacting the model's accuracy (the "deletion capacity"), showing that machine unlearning could be achieved by using differentially private (DP) algorithms. However, their results left open a gap between upper and lower bounds on the deletion capacity of these algorithms: our work fully closes this gap, obtaining tight bounds on the deletion capacity achievable by DP-based machine unlearning algorithms.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Stochastic Gradient Optimization Techniques
