A Neuro-Inspired Autoencoding Defense Against Adversarial Perturbations

Can Bakiskan; Metehan Cekic; Ahmet Dundar Sezer; Upamanyu Madhow

arXiv:2011.10867·cs.LG·December 23, 2020

A Neuro-Inspired Autoencoding Defense Against Adversarial Perturbations

Can Bakiskan, Metehan Cekic, Ahmet Dundar Sezer, Upamanyu Madhow

PDF

Open Access 1 Repo

TL;DR

This paper proposes a neuro-inspired autoencoding defense mechanism that rejects adversarial perturbations before classification, using biologically motivated encoding and decoding, achieving competitive robustness without adversarial training.

Contribution

It introduces a novel, biologically inspired autoencoding framework for adversarial defense that operates on clean data and scales to larger datasets like Imagenet.

Findings

01

Competitive performance on CIFAR-10 against state-of-the-art defenses

02

Effective scaling demonstrated on Imagenet subset

03

Neuro-inspired approach offers a promising alternative to adversarial training

Abstract

Deep Neural Networks (DNNs) are vulnerable to adversarial attacks: carefully constructed perturbations to an image can seriously impair classification accuracy, while being imperceptible to humans. While there has been a significant amount of research on defending against such attacks, most defenses based on systematic design principles have been defeated by appropriately modified attacks. For a fixed set of data, the most effective current defense is to train the network using adversarially perturbed examples. In this paper, we investigate a radically different, neuro-inspired defense mechanism, starting from the observation that human vision is virtually unaffected by adversarial examples designed for machines. We aim to reject L^inf bounded adversarial perturbations before they reach a classifier DNN, using an encoder with characteristics commonly observed in biological vision:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

canbakiskan/neuro-inspired-defense
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Integrated Circuits and Semiconductor Failure Analysis