Involution: Inverting the Inherence of Convolution for Visual Recognition
Duo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong, Zhang, Qifeng Chen

TL;DR
This paper introduces involution, a novel neural network operation that inverts convolution principles, unifies self-attention, and improves visual recognition performance across multiple benchmarks with reduced computational cost.
Contribution
The paper proposes involution, a new operation that replaces convolution by inverting its principles, and demonstrates its effectiveness and efficiency in various vision tasks.
Findings
Involution improves ResNet-50 top-1 accuracy by up to 1.6%.
Involution reduces computational cost to about 65-72% of convolution.
Involution unifies self-attention as a special case.
Abstract
Convolution has been the core ingredient of modern neural networks, triggering the surge of deep learning in vision. In this work, we rethink the inherent principles of standard convolution for vision tasks, specifically spatial-agnostic and channel-specific. Instead, we present a novel atomic operation for deep neural networks by inverting the aforementioned design principles of convolution, coined as involution. We additionally demystify the recent popular self-attention operator and subsume it into our involution family as an over-complicated instantiation. The proposed involution operator could be leveraged as fundamental bricks to build the new generation of neural networks for visual recognition, powering different deep learning models on several prevalent benchmarks, including ImageNet classification, COCO detection and segmentation, together with Cityscapes segmentation. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Involution: Inverting the Inherence of Convolution for Visual Recognition (Research Paper Explained)· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsInvolution · Convolution
