CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification
Mingkun Zhang, Keping Bi, Wei Chen, Jiafeng Guo, Xueqi Cheng

TL;DR
CLIPure introduces a novel purification method in the multi-modal latent space of CLIP, significantly enhancing adversarial robustness for zero-shot image classification without adversarial training.
Contribution
It is the first to perform purification in CLIP's multi-modal latent space and introduces a non-generative purification variant, improving efficiency and robustness.
Findings
Achieves state-of-the-art robustness on CIFAR-10 and ImageNet.
Improves average robustness by 108% across 13 datasets.
Demonstrates effectiveness without adversarial training.
Abstract
In this paper, we aim to build an adversarially robust zero-shot image classifier. We ground our work on CLIP, a vision-language pre-trained encoder model that can perform zero-shot classification by matching an image with text prompts ``a photo of a <class-name>.''. Purification is the path we choose since it does not require adversarial training on specific attack types and thus can cope with any foreseen attacks. We then formulate purification risk as the KL divergence between the joint distributions of the purification process of denoising the adversarial samples and the attack process of adding perturbations to benign samples, through bidirectional Stochastic Differential Equations (SDEs). The final derived results inspire us to explore purification in the multi-modal latent space of CLIP. We propose two variants for our CLIPure approach: CLIPure-Diff which models the likelihood of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · COVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications
MethodsContrastive Language-Image Pre-training
