Can Vision Models Truly Forget? Mirage: Representation-Level Certification of Visual Unlearning
Zhenyu Yu, Yangchen Zeng, Chunlei Meng, Guangzhen Yao, Shuigeng Zhou

TL;DR
Mirage introduces a representation-level auditing framework to accurately assess visual unlearning in federated learning, revealing that existing methods often retain significant class information despite output-level forgetting.
Contribution
The paper proposes Mirage, a novel diagnostics suite for representation-level certification of unlearning, exposing limitations of current output-level metrics in federated visual unlearning.
Findings
Existing methods retain substantial class structure after unlearning.
No method achieves high utility, output-level, and representation-level forgetting simultaneously.
Class-sample asymmetry shows persistent class information at class level but not at sample level.
Abstract
Machine unlearning in Vertical Federated Learning (VFL) has attracted growing interest, yet existing methods certify forgetting solely using output-level metrics. We challenge these claims by introducing Mirage, a representation-level auditing framework comprising four complementary diagnostics: Linear Probe Recovery (LPR), Centered Kernel Alignment (CKA), Feature Separability Scoring, and Layer-Wise Recovery Analysis. Through experiments across seven datasets and seven baseline methods following recent VFL unlearning protocols, Mirage reveals three key findings: (i) Forgetting gap: methods that pass output-level certification still retain substantial class structure in their representations, with LPR exceeding the retrained baseline by up to 15.4 points; CKA shows these models remain structurally closer to the original than to the retrained reference, while separability scores indicate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
