Machine Unlearning
Lucas Bourtoule, Varun Chandrasekaran, Christopher A. Choquette-Choo,, Hengrui Jia, Adelin Travers, Baiwu Zhang, David Lie, Nicolas Papernot

TL;DR
This paper introduces SISA training, a framework that significantly speeds up the process of unlearning data in machine learning models, especially deep neural networks, by strategically partitioning data to reduce computational costs.
Contribution
The paper presents SISA training, a novel data partitioning method that improves unlearning efficiency for deep learning models without sacrificing much accuracy.
Findings
SISA training reduces unlearning time by up to 4.63x on simple tasks.
It achieves a 2.45x speed-up on the SVHN dataset.
Provides a 1.36x speed-up on complex tasks like ImageNet classification.
Abstract
Once users have shared their data online, it is generally difficult for them to revoke access and ask for the data to be deleted. Machine learning (ML) exacerbates this problem because any model trained with said data may have memorized it, putting users at risk of a successful privacy attack exposing their information. Yet, having models unlearn is notoriously difficult. We introduce SISA training, a framework that expedites the unlearning process by strategically limiting the influence of a data point in the training procedure. While our framework is applicable to any learning algorithm, it is designed to achieve the largest improvements for stateful algorithms like stochastic gradient descent for deep neural networks. SISA training reduces the computational overhead associated with unlearning, even in the worst-case setting where unlearning requests are made uniformly across the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Data Quality and Management · Age of Information Optimization
