ZeroPur: Succinct Training-Free Adversarial Purification

Erhu Liu; Zonglin Yang; Bo Liu; Bin Xiao; and Xiuli Bi

arXiv:2406.03143·cs.CV·May 28, 2025

ZeroPur: Succinct Training-Free Adversarial Purification

Erhu Liu, Zonglin Yang, Bo Liu, Bin Xiao, and Xiuli Bi

PDF

Open Access 1 Repo

TL;DR

ZeroPur is a training-free adversarial purification method that effectively defends against unseen attacks by projecting adversarial images onto the natural image manifold without external models or retraining.

Contribution

ZeroPur introduces a simple, training-free purification technique that relies solely on victim classifiers, avoiding retraining or external generative models, and achieves state-of-the-art robustness.

Findings

01

Achieves state-of-the-art robustness on CIFAR-10, CIFAR-100, and ImageNet-1K.

02

Operates without external models or retraining of classifiers.

03

Demonstrates effectiveness across various classifier architectures.

Abstract

Adversarial purification is a kind of defense technique that can defend against various unseen adversarial attacks without modifying the victim classifier. Existing methods often depend on external generative models or cooperation between auxiliary functions and victim classifiers. However, retraining generative models, auxiliary functions, or victim classifiers relies on the domain of the fine-tuned dataset and is computation-consuming. In this work, we suppose that adversarial images are outliers of the natural image manifold, and the purification process can be considered as returning them to this manifold. Following this assumption, we present a simple adversarial purification method without further training to purify adversarial images, called ZeroPur. ZeroPur contains two steps: given an adversarial example, Guided Shift obtains the shifted embedding of the adversarial example by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fra31/auto-attack
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning