Can Vision Models Truly Forget? Mirage: Representation-Level Certification of Visual Unlearning

Zhenyu Yu; Yangchen Zeng; Chunlei Meng; Guangzhen Yao; Shuigeng Zhou

arXiv:2605.20282·cs.CV·May 21, 2026

Can Vision Models Truly Forget? Mirage: Representation-Level Certification of Visual Unlearning

Zhenyu Yu, Yangchen Zeng, Chunlei Meng, Guangzhen Yao, Shuigeng Zhou

PDF

TL;DR

Mirage introduces a representation-level auditing framework to accurately assess visual unlearning in federated learning, revealing that existing methods often retain significant class information despite output-level forgetting.

Contribution

The paper proposes Mirage, a novel diagnostics suite for representation-level certification of unlearning, exposing limitations of current output-level metrics in federated visual unlearning.

Findings

01

Existing methods retain substantial class structure after unlearning.

02

No method achieves high utility, output-level, and representation-level forgetting simultaneously.

03

Class-sample asymmetry shows persistent class information at class level but not at sample level.

Abstract

Machine unlearning in Vertical Federated Learning (VFL) has attracted growing interest, yet existing methods certify forgetting solely using output-level metrics. We challenge these claims by introducing Mirage, a representation-level auditing framework comprising four complementary diagnostics: Linear Probe Recovery (LPR), Centered Kernel Alignment (CKA), Feature Separability Scoring, and Layer-Wise Recovery Analysis. Through experiments across seven datasets and seven baseline methods following recent VFL unlearning protocols, Mirage reveals three key findings: (i) Forgetting gap: methods that pass output-level certification still retain substantial class structure in their representations, with LPR exceeding the retrained baseline by up to 15.4 points; CKA shows these models remain structurally closer to the original than to the retrained reference, while separability scores indicate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.