Omni-Dimensional Dynamic Convolution
Chao Li, Aojun Zhou, Anbang Yao

TL;DR
This paper introduces Omni-dimensional Dynamic Convolution (ODConv), a novel approach that applies multi-dimensional attention across all four kernel dimensions in CNNs, leading to significant accuracy improvements with fewer parameters.
Contribution
The paper proposes a generalized dynamic convolution method that learns attention across all four kernel dimensions simultaneously, enhancing CNN performance and efficiency.
Findings
ODConv improves top-1 accuracy by up to 5.71% on ImageNet.
ODConv with a single kernel outperforms existing multi-kernel dynamic convolutions.
ODConv surpasses other attention modules in modulating CNN features.
Abstract
Learning a single static convolutional kernel in each convolutional layer is the common training paradigm of modern Convolutional Neural Networks (CNNs). Instead, recent research in dynamic convolution shows that learning a linear combination of convolutional kernels weighted with their input-dependent attentions can significantly improve the accuracy of light-weight CNNs, while maintaining efficient inference. However, we observe that existing works endow convolutional kernels with the dynamic property through one dimension (regarding the convolutional kernel number) of the kernel space, but the other three dimensions (regarding the spatial size, the input channel number and the output channel number for each convolutional kernel) are overlooked. Inspired by this, we present Omni-dimensional Dynamic Convolution (ODConv), a more generalized yet elegant dynamic convolution design, to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
MethodsConvolution
