Multi-Glimpse Network: A Robust and Efficient Classification Architecture based on Recurrent Downsampled Attention
Sia Huat Tan, Runpei Dong, Kaisheng Ma

TL;DR
MGNet is a novel recurrent attention-based neural network that sequentially focuses on image regions, improving robustness against adversarial attacks and corruptions while reducing computational costs.
Contribution
It introduces a recurrent downsampled attention mechanism for efficient, robust, and interpretable image classification, outperforming baseline models in accuracy and resistance to attacks.
Findings
MGNet improves 4.76% accuracy on corruptions with less computation.
Maintains 44.2% accuracy under PGD attack, outperforming baseline.
Inherently interpretable by explicitly showing focus regions.
Abstract
Most feedforward convolutional neural networks spend roughly the same efforts for each pixel. Yet human visual recognition is an interaction between eye movements and spatial attention, which we will have several glimpses of an object in different regions. Inspired by this observation, we propose an end-to-end trainable Multi-Glimpse Network (MGNet) which aims to tackle the challenges of high computation and the lack of robustness based on recurrent downsampled attention mechanism. Specifically, MGNet sequentially selects task-relevant regions of an image to focus on and then adaptively combines all collected information for the final prediction. MGNet expresses strong resistance against adversarial attacks and common corruptions with less computation. Also, MGNet is inherently more interpretable as it explicitly informs us where it focuses during each iteration. Our experiments on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
