Omni-Dimensional Dynamic Convolution

Chao Li; Aojun Zhou; Anbang Yao

arXiv:2209.07947·cs.CV·September 19, 2022·173 cites

Omni-Dimensional Dynamic Convolution

Chao Li, Aojun Zhou, Anbang Yao

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Omni-dimensional Dynamic Convolution (ODConv), a novel approach that applies multi-dimensional attention across all four kernel dimensions in CNNs, leading to significant accuracy improvements with fewer parameters.

Contribution

The paper proposes a generalized dynamic convolution method that learns attention across all four kernel dimensions simultaneously, enhancing CNN performance and efficiency.

Findings

01

ODConv improves top-1 accuracy by up to 5.71% on ImageNet.

02

ODConv with a single kernel outperforms existing multi-kernel dynamic convolutions.

03

ODConv surpasses other attention modules in modulating CNN features.

Abstract

Learning a single static convolutional kernel in each convolutional layer is the common training paradigm of modern Convolutional Neural Networks (CNNs). Instead, recent research in dynamic convolution shows that learning a linear combination of $n$ convolutional kernels weighted with their input-dependent attentions can significantly improve the accuracy of light-weight CNNs, while maintaining efficient inference. However, we observe that existing works endow convolutional kernels with the dynamic property through one dimension (regarding the convolutional kernel number) of the kernel space, but the other three dimensions (regarding the spatial size, the input channel number and the output channel number for each convolutional kernel) are overlooked. Inspired by this, we present Omni-dimensional Dynamic Convolution (ODConv), a more generalized yet elegant dynamic convolution design, to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

osvai/odconv
pytorchOfficial

Videos

Omni-Dimensional Dynamic Convolution· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI

MethodsConvolution