What makes unlearning hard and what to do about it

Kairan Zhao; Meghdad Kurmanji; George-Octavian B\u{a}rbulescu; Eleni; Triantafillou; Peter Triantafillou

arXiv:2406.01257·cs.LG·November 1, 2024

What makes unlearning hard and what to do about it

Kairan Zhao, Meghdad Kurmanji, George-Octavian B\u{a}rbulescu, Eleni, Triantafillou, Peter Triantafillou

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates the fundamental challenges of machine unlearning, identifying key factors affecting difficulty, and introduces RUM, a meta-algorithm that improves unlearning performance by refining forget sets into homogeneous subsets.

Contribution

It is the first study to analyze characteristics influencing unlearning difficulty and proposes RUM, a novel framework that enhances existing algorithms by refining forget sets.

Findings

01

Identified two key factors affecting unlearning difficulty.

02

RUM significantly improves performance of existing unlearning algorithms.

03

Unlearning behaviors differ on targeted forget sets versus random ones.

Abstract

Machine unlearning is the problem of removing the effect of a subset of training data (the ''forget set'') from a trained model without damaging the model's utility e.g. to comply with users' requests to delete their data, or remove mislabeled, poisoned or otherwise problematic data. With unlearning research still being at its infancy, many fundamental open questions exist: Are there interpretable characteristics of forget sets that substantially affect the difficulty of the problem? How do these characteristics affect different state-of-the-art algorithms? With this paper, we present the first investigation aiming to answer these questions. We identify two key factors affecting unlearning difficulty and the performance of unlearning algorithms. Evaluation on forget sets that isolate these identified factors reveals previously-unknown behaviours of state-of-the-art algorithms that don't…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kairanzhao/rum
pytorchOfficial

Videos

What makes unlearning hard and what to do about it· slideslive

Taxonomy

TopicsData Stream Mining Techniques · Machine Learning and Data Classification · Mobile Crowdsensing and Crowdsourcing

MethodsSparse Evolutionary Training