Attentive Convolution: Unifying the Expressivity of Self-Attention with Convolutional Efficiency
Hao Yu, Haoyu Chen, Yan Jiang, Wei Peng, Zhaodong Sun, Samuel Kaski, Guoying Zhao

TL;DR
This paper introduces Attentive Convolution (ATConv), a novel convolutional operator inspired by self-attention principles, achieving high efficiency and performance in vision tasks by integrating adaptive routing and lateral inhibition.
Contribution
The paper proposes ATConv, a new convolutional operator that incorporates self-attention principles, improving efficiency and accuracy over traditional convolutions and existing self-attention methods.
Findings
ATConv outperforms various self-attention mechanisms with only 3x3 kernels.
AttNet achieves 84.4% ImageNet-1K Top-1 accuracy with 27M parameters.
Replacing self-attention with ATConv reduces FID in diffusion models.
Abstract
Self-attention (SA) has become the cornerstone of modern vision backbones for its powerful expressivity over traditional Convolutions (Conv). However, its quadratic complexity remains a critical bottleneck for practical applications. Given that Conv offers linear complexity and strong visual priors, continuing efforts have been made to promote the renaissance of Conv. However, a persistent performance chasm remains, highlighting that these modernizations have not yet captured the intrinsic expressivity that defines SA. In this paper, we re-examine the design of the CNNs, directed by a key question: what principles give SA its edge over Conv? As a result, we reveal two fundamental insights that challenge the long-standing design intuitions in prior research (e.g., Receptive field). The two findings are: (1) \textit{Adaptive routing}: SA dynamically regulates positional information flow…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
