A Neuro-Inspired Autoencoding Defense Against Adversarial Perturbations
Can Bakiskan, Metehan Cekic, Ahmet Dundar Sezer, Upamanyu Madhow

TL;DR
This paper proposes a neuro-inspired autoencoding defense mechanism that rejects adversarial perturbations before classification, using biologically motivated encoding and decoding, achieving competitive robustness without adversarial training.
Contribution
It introduces a novel, biologically inspired autoencoding framework for adversarial defense that operates on clean data and scales to larger datasets like Imagenet.
Findings
Competitive performance on CIFAR-10 against state-of-the-art defenses
Effective scaling demonstrated on Imagenet subset
Neuro-inspired approach offers a promising alternative to adversarial training
Abstract
Deep Neural Networks (DNNs) are vulnerable to adversarial attacks: carefully constructed perturbations to an image can seriously impair classification accuracy, while being imperceptible to humans. While there has been a significant amount of research on defending against such attacks, most defenses based on systematic design principles have been defeated by appropriately modified attacks. For a fixed set of data, the most effective current defense is to train the network using adversarially perturbed examples. In this paper, we investigate a radically different, neuro-inspired defense mechanism, starting from the observation that human vision is virtually unaffected by adversarial examples designed for machines. We aim to reject L^inf bounded adversarial perturbations before they reach a classifier DNN, using an encoder with characteristics commonly observed in biological vision:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Integrated Circuits and Semiconductor Failure Analysis
