Erasing CLIP Memories: Non-Destructive, Data-Free Zero-Shot class Unlearning in CLIP Models

Ashish Mishra; Tarun Kumar; Gyanaranjan Nayak; Arpit Shah; Suparna Bhattacharya; Martin Foltin

arXiv:2512.14137·cs.CV·December 17, 2025

Erasing CLIP Memories: Non-Destructive, Data-Free Zero-Shot class Unlearning in CLIP Models

Ashish Mishra, Tarun Kumar, Gyanaranjan Nayak, Arpit Shah, Suparna Bhattacharya, Martin Foltin

PDF

Open Access

TL;DR

This paper presents a non-destructive, data-free method for unlearning specific classes in CLIP models by nullspace projection, effectively erasing class information without retraining or data access.

Contribution

The authors introduce a closed-form, nullspace projection technique for selective class unlearning in CLIP, avoiding retraining and data requirements, and enabling precise model decontamination.

Findings

01

Significant reduction in target class performance after unlearning

02

Method preserves overall model knowledge and multimodal capabilities

03

Partial projection balances unlearning effectiveness and knowledge retention

Abstract

We introduce a novel, closed-form approach for selective unlearning in multimodal models, specifically targeting pretrained models such as CLIP. Our method leverages nullspace projection to erase the target class information embedded in the final projection layer, without requiring any retraining or the use of images from the forget set. By computing an orthonormal basis for the subspace spanned by target text embeddings and projecting these directions, we dramatically reduce the alignment between image features and undesired classes. Unlike traditional unlearning techniques that rely on iterative fine-tuning and extensive data curation, our approach is both computationally efficient and surgically precise. This leads to a pronounced drop in zero-shot performance for the target classes while preserving the overall multimodal knowledge of the model. Our experiments demonstrate that even…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning