TL;DR
This paper introduces a neural network with a human-inspired sequential attention mechanism that enhances adversarial robustness and reveals new types of adversarial examples with salient, recognizable structures.
Contribution
The paper presents a novel attention-augmented neural network that improves adversarial robustness and demonstrates a dynamic 'computational race' between attack and defense strategies.
Findings
Attention improves ImageNet accuracy under attacks
Varying attention steps enhances defense capabilities
Adversarial examples can contain recognizable, salient structures
Abstract
In this paper we propose to augment a modern neural-network architecture with an attention model inspired by human perception. Specifically, we adversarially train and analyze a neural model incorporating a human inspired, visual attention component that is guided by a recurrent top-down sequential process. Our experimental evaluation uncovers several notable findings about the robustness and behavior of this new model. First, introducing attention to the model significantly improves adversarial robustness resulting in state-of-the-art ImageNet accuracies under a wide range of random targeted attack strengths. Second, we show that by varying the number of attention steps (glances/fixations) for which the model is unrolled, we are able to make its defense capabilities stronger, even in light of stronger attacks --- resulting in a "computational race" between the attacker and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Towards Robust Image Classification Using Sequential Attention Models· youtube
