Defending Against Adversarial Attacks by Leveraging an Entire GAN

Gokula Krishnan Santhanam; Paulina Grnarova

arXiv:1805.10652·stat.ML·May 29, 2018·24 cites

Defending Against Adversarial Attacks by Leveraging an Entire GAN

Gokula Krishnan Santhanam, Paulina Grnarova

PDF

Open Access 1 Repo

TL;DR

This paper introduces 'cowboy', a GAN-based method that detects and defends against adversarial attacks by leveraging the discriminator and generator to identify and project adversarial samples back onto the data manifold, enhancing robustness.

Contribution

The paper presents a novel GAN-based approach that detects adversarial samples using the discriminator and cleans them via the generator, independent of classifier and attack type.

Findings

01

Discriminator scores are lower for adversarial samples across datasets.

02

Adversarial samples lie outside the learned data manifold.

03

The cleaning method effectively projects samples back onto the data manifold.

Abstract

Recent work has shown that state-of-the-art models are highly vulnerable to adversarial perturbations of the input. We propose cowboy, an approach to detecting and defending against adversarial attacks by using both the discriminator and generator of a GAN trained on the same dataset. We show that the discriminator consistently scores the adversarial samples lower than the real samples across multiple attacks and datasets. We provide empirical evidence that adversarial samples lie outside of the data manifold learned by the GAN. Based on this, we propose a cleaning method which uses both the discriminator and generator of the GAN to project the samples back onto the data manifold. This cleaning procedure is independent of the classifier and type of attack and thus can be deployed in existing systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mnswdhw/DefenseGAN-and-Cowboy-Defense
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis

MethodsConvolution · Dogecoin Customer Service Number +1-833-534-1729