DeepClean: Machine Unlearning on the Cheap by Resetting Privacy   Sensitive Weights using the Fisher Diagonal

Jiaeli Shi; Najah Ghalyan; Kostis Gourgoulias; John Buford; Sean Moran

arXiv:2311.10448·cs.LG·April 29, 2024·2 cites

DeepClean: Machine Unlearning on the Cheap by Resetting Privacy Sensitive Weights using the Fisher Diagonal

Jiaeli Shi, Najah Ghalyan, Kostis Gourgoulias, John Buford, Sean Moran

PDF

Open Access

TL;DR

DeepClean introduces a computationally efficient machine unlearning method that uses the Fisher Information Matrix diagonal to selectively forget sensitive training data, enhancing privacy without full retraining.

Contribution

We propose a novel lightweight unlearning algorithm leveraging the Fisher Information Matrix diagonal for efficient privacy-preserving data removal.

Findings

01

Successfully forgets random data subsets across neural networks

02

Achieves effective privacy protection with reduced computational cost

03

Provides an interpretable and practical unlearning approach

Abstract

Machine learning models trained on sensitive or private data can inadvertently memorize and leak that information. Machine unlearning seeks to retroactively remove such details from model weights to protect privacy. We contribute a lightweight unlearning algorithm that leverages the Fisher Information Matrix (FIM) for selective forgetting. Prior work in this area requires full retraining or large matrix inversions, which are computationally expensive. Our key insight is that the diagonal elements of the FIM, which measure the sensitivity of log-likelihood to changes in weights, contain sufficient information for effective forgetting. Specifically, we compute the FIM diagonal over two subsets -- the data to retain and forget -- for all trainable weights. This diagonal representation approximates the complete FIM while dramatically reducing computation. We then use it to selectively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques