CAP-GAN: Towards Adversarial Robustness with Cycle-consistent Attentional Purification
Mingu Kang, Trung Quang Tran, Seungju Cho, Daeyoung Kim

TL;DR
CAP-GAN is a novel purification model that enhances adversarial robustness by leveraging cycle-consistent learning, guided attention, and knowledge distillation to produce clean-like images and defend against various attack strategies.
Contribution
It introduces a cycle-consistent attentional purification framework that improves adversarial defense by combining pixel and feature consistency with attention and knowledge distillation.
Findings
CAP-GAN outperforms existing defenses on CIFAR-10.
Effective against both black-box and white-box attacks.
Enhances robustness through cycle-consistent learning.
Abstract
Adversarial attack is aimed at fooling the target classifier with imperceptible perturbation. Adversarial examples, which are carefully crafted with a malicious purpose, can lead to erroneous predictions, resulting in catastrophic accidents. To mitigate the effects of adversarial attacks, we propose a novel purification model called CAP-GAN. CAP-GAN takes account of the idea of pixel-level and feature-level consistency to achieve reasonable purification under cycle-consistent learning. Specifically, we utilize the guided attention module and knowledge distillation to convey meaningful information to the purification model. Once a model is fully trained, inputs would be projected into the purification model and transformed into clean-like images. We vary the capacity of the adversary to argue the robustness against various types of attack strategies. On the CIFAR-10 dataset, CAP-GAN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKnowledge Distillation
