Benchmarking Unlearning for Vision Transformers
Kairan Zhao, Iurie Luca, Peter Triantafillou

TL;DR
This paper establishes a comprehensive benchmark for machine unlearning in Vision Transformers, comparing various algorithms across datasets and protocols to evaluate their effectiveness and memorization characteristics.
Contribution
It is the first to benchmark MU algorithms on Vision Transformers, analyzing their performance, memorization, and impact of dataset complexity across multiple protocols.
Findings
MU algorithms vary significantly in VT settings
Memorization influences unlearning effectiveness
Benchmark provides a baseline for future research
Abstract
Research in machine unlearning (MU) has gained strong momentum: MU is now widely regarded as a critical capability for building safe and fair AI. In parallel, research into transformer architectures for computer vision tasks has been highly successful: Increasingly, Vision Transformers (VTs) emerge as strong alternatives to CNNs. Yet, MU research for vision tasks has largely centered on CNNs, not VTs. While benchmarking MU efforts have addressed LLMs, diffusion models, and CNNs, none exist for VTs. This work is the first to attempt this, benchmarking MU algorithm performance in different VT families (ViT and Swin-T) and at different capacities. The work employs (i) different datasets, selected to assess the impacts of dataset scale and complexity; (ii) different MU algorithms, selected to represent fundamentally different approaches for MU; and (iii) both single-shot and continual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Explainable Artificial Intelligence (XAI)
