What makes unlearning hard and what to do about it
Kairan Zhao, Meghdad Kurmanji, George-Octavian B\u{a}rbulescu, Eleni, Triantafillou, Peter Triantafillou

TL;DR
This paper investigates the fundamental challenges of machine unlearning, identifying key factors affecting difficulty, and introduces RUM, a meta-algorithm that improves unlearning performance by refining forget sets into homogeneous subsets.
Contribution
It is the first study to analyze characteristics influencing unlearning difficulty and proposes RUM, a novel framework that enhances existing algorithms by refining forget sets.
Findings
Identified two key factors affecting unlearning difficulty.
RUM significantly improves performance of existing unlearning algorithms.
Unlearning behaviors differ on targeted forget sets versus random ones.
Abstract
Machine unlearning is the problem of removing the effect of a subset of training data (the ''forget set'') from a trained model without damaging the model's utility e.g. to comply with users' requests to delete their data, or remove mislabeled, poisoned or otherwise problematic data. With unlearning research still being at its infancy, many fundamental open questions exist: Are there interpretable characteristics of forget sets that substantially affect the difficulty of the problem? How do these characteristics affect different state-of-the-art algorithms? With this paper, we present the first investigation aiming to answer these questions. We identify two key factors affecting unlearning difficulty and the performance of unlearning algorithms. Evaluation on forget sets that isolate these identified factors reveals previously-unknown behaviours of state-of-the-art algorithms that don't…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsData Stream Mining Techniques · Machine Learning and Data Classification · Mobile Crowdsensing and Crowdsourcing
MethodsSparse Evolutionary Training
