Tight Bounds for Machine Unlearning via Differential Privacy

Yiyang Huang; Cl\'ement L. Canonne

arXiv:2309.00886·cs.LG·September 6, 2023·1 cites

Tight Bounds for Machine Unlearning via Differential Privacy

Yiyang Huang, Cl\'ement L. Canonne

PDF

Open Access

TL;DR

This paper establishes tight bounds on the number of data points that can be effectively unlearned by differentially private algorithms, advancing the theoretical understanding of machine unlearning's limits.

Contribution

It provides the first complete characterization of the deletion capacity for DP-based machine unlearning, closing previous bounds gap.

Findings

01

Tight bounds on unlearning capacity established

02

Theoretical limits for DP algorithms in unlearning clarified

03

Implications for privacy-preserving machine learning discussed

Abstract

We consider the formulation of "machine unlearning" of Sekhari, Acharya, Kamath, and Suresh (NeurIPS 2021), which formalizes the so-called "right to be forgotten" by requiring that a trained model, upon request, should be able to "unlearn" a number of points from the training data, as if they had never been included in the first place. Sekhari et al. established some positive and negative results about the number of data points that can be successfully unlearnt by a trained model without impacting the model's accuracy (the "deletion capacity"), showing that machine unlearning could be achieved by using differentially private (DP) algorithms. However, their results left open a gap between upper and lower bounds on the deletion capacity of these algorithms: our work fully closes this gap, obtaining tight bounds on the deletion capacity achievable by DP-based machine unlearning algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Stochastic Gradient Optimization Techniques