RESTOR: Knowledge Recovery in Machine Unlearning

Keivan Rezaei; Khyathi Chandu; Soheil Feizi; Yejin Choi; Faeze Brahman; Abhilasha Ravichander

arXiv:2411.00204·cs.CL·May 27, 2025

RESTOR: Knowledge Recovery in Machine Unlearning

Keivan Rezaei, Khyathi Chandu, Soheil Feizi, Yejin Choi, Faeze Brahman, Abhilasha Ravichander

PDF

Open Access 2 Repos 2 Datasets

TL;DR

RESTOR is a framework that evaluates machine unlearning algorithms by measuring their ability to forget specific data and recover the original knowledge state, revealing insights into their mechanisms and effectiveness.

Contribution

This work introduces RESTOR, a novel evaluation framework for machine unlearning that assesses both forgetting and knowledge recovery, addressing limitations of previous heuristics.

Findings

01

Some algorithms focus only on forgetting, not recovery

02

Localizing unlearning targets improves performance

03

RESTOR uncovers mechanisms of popular unlearning methods

Abstract

Large language models trained on web-scale corpora can memorize undesirable data containing misinformation, copyrighted material, or private or sensitive information. Recently, several machine unlearning algorithms have been proposed to eliminate the effect of such datapoints from trained models -- that is, to approximate a model that had never been trained on these datapoints in the first place. However, evaluating the effectiveness of unlearning algorithms remains an open challenge. Previous work has relied on heuristics -- such as verifying that the model can no longer reproduce the specific information targeted for removal while maintaining accuracy on unrelated test data. These approaches inadequately capture the complete effect of reversing the influence of datapoints on a trained model. In this work, we propose the RESTOR framework for machine unlearning evaluation, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Data Quality and Management