Rewind-to-Delete: Certified Machine Unlearning for Nonconvex Functions
Siqiao Mu, Diego Klabjan

TL;DR
This paper introduces a novel certified machine unlearning algorithm for nonconvex models that rewinds training to an earlier state before removing data, providing strong privacy guarantees and practical efficiency.
Contribution
It presents the first first-order, black-box unlearning method applicable to general nonconvex functions with theoretical guarantees and improved empirical performance.
Findings
Proves $(psilon, elta)$ certified unlearning guarantees.
Establishes generalization bounds under Polyak-Lojasiewicz condition.
Demonstrates superior empirical performance over existing methods.
Abstract
Machine unlearning algorithms aim to efficiently remove data from a model without retraining it from scratch, in order to remove corrupted or outdated data or respect a user's ``right to be forgotten." Certified machine unlearning is a strong theoretical guarantee based on differential privacy that quantifies the extent to which an algorithm erases data from the model weights. In contrast to existing works in certified unlearning for convex or strongly convex loss functions, or nonconvex objectives with limiting assumptions, we propose the first, first-order, black-box (i.e., can be applied to models pretrained with vanilla gradient descent) algorithm for unlearning on general nonconvex loss functions, which unlearns by ``rewinding" to an earlier step during the learning process before performing gradient descent on the loss function of the retained data points. We prove $(\epsilon,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Control Systems Optimization · Fault Detection and Control Systems · Control Systems and Identification
MethodsFocus
