Evaluating of Machine Unlearning: Robustness Verification Without Prior Modifications
Heng Xu, Tianqing Zhu, Wanlei Zhou

TL;DR
This paper introduces a new robustness verification method for machine unlearning that does not require prior sample modifications, enabling more reliable and scalable verification of unlearning in ML models.
Contribution
The paper proposes a novel, optimization-based verification scheme that operates solely on model parameters, supporting larger verification sets without prior modifications.
Findings
Supports verification on larger sample sets
Operates without prior sample modifications
Demonstrates robustness through theoretical and experimental analysis
Abstract
Machine unlearning, a process enabling pre-trained models to remove the influence of specific training samples, has attracted significant attention in recent years. While extensive research has focused on developing efficient unlearning strategies, the critical aspect of unlearning verification has been largely overlooked. Existing verification methods mainly rely on machine learning attack techniques, such as membership inference attacks (MIAs) or backdoor attacks. However, these methods, not being formally designed for verification purposes, exhibit limitations in robustness and only support a small, predefined subset of samples. Moreover, dependence on prepared sample-level modifications of MIAs or backdoor attacks restricts their applicability in Machine Learning as a Service (MLaaS) environments. To address these limitations, we propose a novel robustness verification scheme…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems
