Adversarial Purification by Consistency-aware Latent Space Optimization on Data Manifolds
Shuhai Zhang, Jiahao Yang, Hui Luo, Jie Chen, Li Wang, Feng Liu, Bo, Han, Mingkui Tan

TL;DR
This paper introduces CMAP, a novel adversarial purification method that optimizes latent space vectors of a generative model to effectively remove adversarial perturbations while preserving data integrity, improving robustness and accuracy.
Contribution
It proposes a consistency model-based purification approach that operates in the latent space, addressing limitations of traditional methods by preserving data structure and semantic information.
Findings
Significantly improves robustness against strong adversarial attacks.
Maintains high natural accuracy on CIFAR-10 and ImageNet-100.
Outperforms existing purification methods in experiments.
Abstract
Deep neural networks (DNNs) are vulnerable to adversarial samples crafted by adding imperceptible perturbations to clean data, potentially leading to incorrect and dangerous predictions. Adversarial purification has been an effective means to improve DNNs robustness by removing these perturbations before feeding the data into the model. However, it faces significant challenges in preserving key structural and semantic information of data, as the imperceptible nature of adversarial perturbations makes it hard to avoid over-correcting, which can destroy important information and degrade model performance. In this paper, we break away from traditional adversarial purification methods by focusing on the clean data manifold. To this end, we reveal that samples generated by a well-trained generative model are close to clean ones but far from adversarial ones. Leveraging this insight, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis
MethodsALIGN
