DeepClean: Machine Unlearning on the Cheap by Resetting Privacy Sensitive Weights using the Fisher Diagonal
Jiaeli Shi, Najah Ghalyan, Kostis Gourgoulias, John Buford, Sean Moran

TL;DR
DeepClean introduces a computationally efficient machine unlearning method that uses the Fisher Information Matrix diagonal to selectively forget sensitive training data, enhancing privacy without full retraining.
Contribution
We propose a novel lightweight unlearning algorithm leveraging the Fisher Information Matrix diagonal for efficient privacy-preserving data removal.
Findings
Successfully forgets random data subsets across neural networks
Achieves effective privacy protection with reduced computational cost
Provides an interpretable and practical unlearning approach
Abstract
Machine learning models trained on sensitive or private data can inadvertently memorize and leak that information. Machine unlearning seeks to retroactively remove such details from model weights to protect privacy. We contribute a lightweight unlearning algorithm that leverages the Fisher Information Matrix (FIM) for selective forgetting. Prior work in this area requires full retraining or large matrix inversions, which are computationally expensive. Our key insight is that the diagonal elements of the FIM, which measure the sensitivity of log-likelihood to changes in weights, contain sufficient information for effective forgetting. Specifically, we compute the FIM diagonal over two subsets -- the data to retain and forget -- for all trainable weights. This diagonal representation approximates the complete FIM while dramatically reducing computation. We then use it to selectively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques
