Revisiting Machine Unlearning with Dimensional Alignment

Seonguk Seo; Dongwan Kim; Bohyung Han

arXiv:2407.17710·cs.LG·December 24, 2024

Revisiting Machine Unlearning with Dimensional Alignment

Seonguk Seo, Dongwan Kim, Bohyung Han

PDF

TL;DR

This paper introduces a novel framework for machine unlearning that uses dimensional alignment to improve stability and effectiveness, addressing flaws in existing evaluation metrics and ensuring models forget specific data while retaining overall knowledge.

Contribution

The paper proposes a new unlearning framework based on eigenspace alignment, a novel evaluation metric called dimensional alignment, and improved training schemes for stable, effective unlearning.

Findings

01

The proposed method effectively removes information from the forget set.

02

It preserves knowledge from the retain set.

03

New evaluation tools better reflect unlearning goals.

Abstract

Machine unlearning, an emerging research topic focusing on compliance with data privacy regulations, enables trained models to remove the information learned from specific data. While many existing methods indirectly address this issue by intentionally injecting incorrect supervisions, they can drastically and unpredictably alter the decision boundaries and feature spaces, leading to training instability and undesired side effects. To fundamentally approach this task, we first analyze the changes in latent feature spaces between original and retrained models, and observe that the feature representations of samples not involved in training are closely aligned with the feature manifolds of previously seen samples in training. Based on these findings, we introduce a novel evaluation metric for machine unlearning, coined dimensional alignment, which measures the alignment between the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training