Machine Unlearning under Overparameterization
Jacob L. Block, Aryan Mokhtari, Sanjay Shakkottai

TL;DR
This paper introduces a new framework for machine unlearning in overparameterized models, defining unlearning as finding the minimum-complexity interpolator and developing algorithms that outperform existing methods.
Contribution
The paper proposes a novel unlearning framework suitable for overparameterized models, with new algorithms based on gradient orthogonality and theoretical guarantees.
Findings
Outperforms existing unlearning baselines in experiments
Provides exact and approximate unlearning guarantees
Defines unlearning as minimum-complexity interpolation
Abstract
Machine unlearning algorithms aim to remove the influence of specific training samples, ideally recovering the model that would have resulted from training on the remaining data alone. We study unlearning in the overparameterized setting, where many models interpolate the data, and defining the solution as any loss minimizer over the retained setas in prior work in the underparameterized settingis inadequate, since the original model may already interpolate the retained data and satisfy this condition. In this regime, loss gradients vanish, rendering prior methods based on gradient perturbations ineffective, motivating both new unlearning definitions and algorithms. For this setting, we define the unlearning solution as the minimum-complexity interpolator over the retained data and propose a new algorithmic framework that only requires access to model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and Data Classification · Advanced Neural Network Applications
