Really Unlearned? Verifying Machine Unlearning via Influential Sample Pairs
Heng Xu, Tianqing Zhu, Lefeng Zhang, Wanlei Zhou

TL;DR
This paper introduces a formal verification method called IndirectVerify for confirming successful machine unlearning by using influential sample pairs, improving robustness over existing attack-based verification schemes.
Contribution
The paper proposes a novel formal verification scheme for machine unlearning that employs influential sample pairs and a perturbation-based generation method, enhancing robustness against bypassing.
Findings
Effective verification of unlearning requests using influential sample pairs.
Enhanced robustness of verification compared to existing attack-based methods.
Perturbation-based scheme successfully generates influential sample pairs.
Abstract
Machine unlearning enables pre-trained models to eliminate the effects of partial training samples. Previous research has mainly focused on proposing efficient unlearning strategies. However, the verification of machine unlearning, or in other words, how to guarantee that a sample has been successfully unlearned, has been overlooked for a long time. Existing verification schemes typically rely on machine learning attack techniques, such as backdoor or membership inference attacks. As these techniques are not formally designed for verification, they are easily bypassed when an untrustworthy MLaaS undergoes rapid fine-tuning to merely meet the verification conditions, rather than executing real unlearning. In this paper, we propose a formal verification scheme, IndirectVerify, to determine whether unlearning requests have been successfully executed. We design influential sample pairs: one…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Neural Networks and Applications
