CLIPure: Purification in Latent Space via CLIP for Adversarially Robust   Zero-Shot Classification

Mingkun Zhang; Keping Bi; Wei Chen; Jiafeng Guo; Xueqi Cheng

arXiv:2502.18176·cs.CV·March 4, 2025

CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification

Mingkun Zhang, Keping Bi, Wei Chen, Jiafeng Guo, Xueqi Cheng

PDF

Open Access 1 Repo

TL;DR

CLIPure introduces a novel purification method in the multi-modal latent space of CLIP, significantly enhancing adversarial robustness for zero-shot image classification without adversarial training.

Contribution

It is the first to perform purification in CLIP's multi-modal latent space and introduces a non-generative purification variant, improving efficiency and robustness.

Findings

01

Achieves state-of-the-art robustness on CIFAR-10 and ImageNet.

02

Improves average robustness by 108% across 13 datasets.

03

Demonstrates effectiveness without adversarial training.

Abstract

In this paper, we aim to build an adversarially robust zero-shot image classifier. We ground our work on CLIP, a vision-language pre-trained encoder model that can perform zero-shot classification by matching an image with text prompts ``a photo of a <class-name>.''. Purification is the path we choose since it does not require adversarial training on specific attack types and thus can cope with any foreseen attacks. We then formulate purification risk as the KL divergence between the joint distributions of the purification process of denoising the adversarial samples and the attack process of adding perturbations to benign samples, through bidirectional Stochastic Differential Equations (SDEs). The final derived results inspire us to explore purification in the multi-modal latent space of CLIP. We propose two variants for our CLIPure approach: CLIPure-Diff which models the likelihood of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tmlresearchgroup-cas/clipure
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · COVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications

MethodsContrastive Language-Image Pre-training