Unlearning via Sparse Representations

Vedant Shah; Frederik Tr\"auble; Ashish Malik; Hugo Larochelle,; Michael Mozer; Sanjeev Arora; Yoshua Bengio; Anirudh Goyal

arXiv:2311.15268·cs.LG·October 14, 2024·1 cites

Unlearning via Sparse Representations

Vedant Shah, Frederik Tr\"auble, Ashish Malik, Hugo Larochelle,, Michael Mozer, Sanjeev Arora, Yoshua Bengio, Anirudh Goyal

PDF

Open Access 3 Reviews

TL;DR

This paper introduces a nearly compute-free, zero-shot unlearning method using discrete representations, effectively erasing specific data from models with minimal performance impact and significantly reduced computational costs.

Contribution

The paper presents a novel unlearning technique based on a discrete representational bottleneck that is nearly compute-free and outperforms existing methods like SCRUB in efficiency.

Findings

01

Effective unlearning on CIFAR-10, CIFAR-100, and LACUNA-100 datasets.

02

Achieves comparable or better performance than SCRUB.

03

Incurs almost no additional computational cost.

Abstract

Machine \emph{unlearning}, which involves erasing knowledge about a \emph{forget set} from a trained model, can prove to be costly and infeasible by existing techniques. We propose a nearly compute-free zero-shot unlearning technique based on a discrete representational bottleneck. We show that the proposed technique efficiently unlearns the forget set and incurs negligible damage to the model's performance on the rest of the data set. We evaluate the proposed technique on the problem of \textit{class unlearning} using three datasets: CIFAR-10, CIFAR-100, and LACUNA-100. We compare the proposed technique to SCRUB, a state-of-the-art approach which uses knowledge distillation for unlearning. Across all three datasets, the proposed technique performs as well as, if not better than SCRUB while incurring almost no computational cost.

Peer Reviews

Decision·Submitted to ICLR 2024

Reviewer 01Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

**Writing**: The paper's logic is clear, and it is easy to follow.

Weaknesses

- **Incremental Novelty**: While the application of concepts from DKVB [1] to the context of machine unlearning is interesting, it may appear as an incremental advance rather than a groundbreaking innovation. Nonetheless, the practical integration of these ideas to address real-world challenges is acknowledged as valuable and can sometimes be sufficient to make significant contributions to the field. (Minor points) - **Comparative Analysis**: The paper could benefit from a broader comparison wi

Reviewer 02Rating 6· marginally above the acceptance thresholdConfidence 2

Strengths

The paper demonstrates strong performance gains, and is computationally inexpensive. The method only requires forward passes since irrelevant key-value pairs are simply pruned unlike previous methods which utilize some form of negative gradients updates or knowledge distillation. The paper is generally an easy read and figures are clear.

Weaknesses

In terms of novelty, I think it’s important to point out that this paper is a direct application of Discrete Key-Value Bottleneck (Trauble 2022). 1. DKVB was proposed for class-incremental learning, and class unlearning is quite literally the inverse task. 2. The original work also shows improvements on CIFAR10/100, the same benchmarks in this paper. 3. DKVB improves class-incremental learning since each class can be learned by disjoint key-value pairs, thus the model updates for learning new

Reviewer 03Rating 5· marginally below the acceptance thresholdConfidence 2

Strengths

1. Zero Shot Unlearning. 2. Low computational cost 3. Model Specific only applicable to models with Discrete Key Value Bottleneck.

Weaknesses

1. Even though the whole motivation of this approach is to implement a new approach with better computational efficiency no experiments on computational cost are done. This approach is not measured with current approaches of class unlearning in terms of computational efficiency.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Image Enhancement Techniques

MethodsSparse Evolutionary Training · Knowledge Distillation