CondConv: Conditionally Parameterized Convolutions for Efficient Inference
Brandon Yang, Gabriel Bender, Quoc V. Le, Jiquan Ngiam

TL;DR
CondConv introduces conditionally parameterized convolutions that learn specialized kernels per example, enhancing neural network capacity and performance while maintaining efficient inference, demonstrated on ImageNet classification.
Contribution
This paper presents CondConv, a novel convolutional layer that learns per-example kernels, enabling larger, more capable networks without increased inference cost.
Findings
Improved accuracy on ImageNet with EfficientNet-B0 using CondConv.
Achieved 78.3% top-1 accuracy with only 413M multiply-adds.
Enhanced performance on detection tasks with CondConv.
Abstract
Convolutional layers are one of the basic building blocks of modern deep neural networks. One fundamental assumption is that convolutional kernels should be shared for all examples in a dataset. We propose conditionally parameterized convolutions (CondConv), which learn specialized convolutional kernels for each example. Replacing normal convolutions with CondConv enables us to increase the size and capacity of a network, while maintaining efficient inference. We demonstrate that scaling networks with CondConv improves the performance and inference cost trade-off of several existing convolutional neural network architectures on both classification and detection tasks. On ImageNet classification, our CondConv approach applied to EfficientNet-B0 achieves state-of-the-art performance of 78.3% accuracy with only 413M multiply-adds. Code and checkpoints for the CondConv Tensorflow layer and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
MethodsCosine Annealing · RMSProp · Tanh Activation · Residual Connection · Sigmoid Activation · *Communicated@Fast*How Do I Communicate to Expedia? · Linear Layer · Depthwise Convolution · Pointwise Convolution · Non Maximum Suppression
