Adversarial Purification by Consistency-aware Latent Space Optimization   on Data Manifolds

Shuhai Zhang; Jiahao Yang; Hui Luo; Jie Chen; Li Wang; Feng Liu; Bo; Han; Mingkui Tan

arXiv:2412.08394·cs.LG·December 12, 2024

Adversarial Purification by Consistency-aware Latent Space Optimization on Data Manifolds

Shuhai Zhang, Jiahao Yang, Hui Luo, Jie Chen, Li Wang, Feng Liu, Bo, Han, Mingkui Tan

PDF

Open Access

TL;DR

This paper introduces CMAP, a novel adversarial purification method that optimizes latent space vectors of a generative model to effectively remove adversarial perturbations while preserving data integrity, improving robustness and accuracy.

Contribution

It proposes a consistency model-based purification approach that operates in the latent space, addressing limitations of traditional methods by preserving data structure and semantic information.

Findings

01

Significantly improves robustness against strong adversarial attacks.

02

Maintains high natural accuracy on CIFAR-10 and ImageNet-100.

03

Outperforms existing purification methods in experiments.

Abstract

Deep neural networks (DNNs) are vulnerable to adversarial samples crafted by adding imperceptible perturbations to clean data, potentially leading to incorrect and dangerous predictions. Adversarial purification has been an effective means to improve DNNs robustness by removing these perturbations before feeding the data into the model. However, it faces significant challenges in preserving key structural and semantic information of data, as the imperceptible nature of adversarial perturbations makes it hard to avoid over-correcting, which can destroy important information and degrade model performance. In this paper, we break away from traditional adversarial purification methods by focusing on the clean data manifold. To this end, we reveal that samples generated by a well-trained generative model are close to clean ones but far from adversarial ones. Leveraging this insight, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis

MethodsALIGN