GAttANet: Global attention agreement for convolutional neural networks
Rufin VanRullen, Andrea Alamia

TL;DR
GAttANet introduces a brain-inspired global attention mechanism that enhances convolutional neural networks by pooling spatial key-query vectors into a global attention query, modulating activations based on agreement, and improving accuracy across various architectures and datasets.
Contribution
This paper presents a novel global attention system inspired by biological brains, integrated into CNNs to improve performance with minimal additional parameters.
Findings
Improves accuracy on CIFAR10, CIFAR100, and ImageNet-1k datasets.
Effective across different CNN architectures, including ResNet50.
Requires relatively few additional parameters.
Abstract
Transformer attention architectures, similar to those developed for natural language processing, have recently proved efficient also in vision, either in conjunction with or as a replacement for convolutional layers. Typically, visual attention is inserted in the network architecture as a (series of) feedforward self-attention module(s), with mutual key-query agreement as the main selection and routing operation. However efficient, this strategy is only vaguely compatible with the way that attention is implemented in biological brains: as a separate and unified network of attentional selection regions, receiving inputs from and exerting modulatory influence on the entire hierarchy of visual regions. Here, we report experiments with a simple such attention system that can improve the performance of standard convolutional networks, with relatively few additional parameters. Each spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Visual Attention and Saliency Detection · Advanced Neural Network Applications
