Descend or Rewind? Stochastic Gradient Descent Unlearning

Siqiao Mu; Diego Klabjan

arXiv:2511.15983·cs.LG·March 2, 2026

Descend or Rewind? Stochastic Gradient Descent Unlearning

Siqiao Mu, Diego Klabjan

PDF

Open Access

TL;DR

This paper provides theoretical guarantees for stochastic gradient descent unlearning algorithms, specifically D2D and R2D, across different convexity settings, and compares their empirical performance.

Contribution

It offers the first $( ext{}\varepsilon, ext{} ext{}\delta)$ unlearning guarantees for stochastic D2D and R2D algorithms on various loss functions, with a novel analysis framework.

Findings

01

D2D provides tighter guarantees for strongly convex functions.

02

R2D is more suitable for convex and nonconvex functions.

03

Empirical results highlight the strengths and weaknesses of each algorithm.

Abstract

Machine unlearning algorithms aim to remove the impact of selected training data from a model without the computational expenses of retraining from scratch. Two such algorithms are ``Descent-to-Delete" (D2D) and ``Rewind-to-Delete" (R2D), full-batch gradient descent algorithms that are easy to implement and satisfy provable unlearning guarantees. In particular, the stochastic version of D2D is widely implemented as the ``finetuning" unlearning baseline, despite lacking theoretical backing on nonconvex functions. In this work, we prove $(ε, δ)$ certified unlearning guarantees for stochastic R2D and D2D for strongly convex, convex, and nonconvex loss functions, by analyzing unlearning through the lens of disturbed or biased gradient systems, which may be contracting, semi-contracting, or expansive respectively. Our argument relies on optimally coupling the random behavior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data