DO-Conv: Depthwise Over-parameterized Convolutional Layer
Jinming Cao, Yangyan Li, Mingchao Sun, Ying Chen, Dani Lischinski,, Daniel Cohen-Or, Baoquan Chen, Changhe Tu

TL;DR
DO-Conv introduces an over-parameterized convolutional layer that enhances CNN performance on vision tasks without increasing inference complexity by combining a depthwise convolution with a standard convolution.
Contribution
The paper proposes DO-Conv, a novel over-parameterized convolutional layer that improves CNN accuracy while maintaining inference efficiency through layer folding.
Findings
Boosts CNN performance on classification, detection, and segmentation tasks.
Increases training accuracy without additional inference cost.
Open-source implementation available in major deep learning frameworks.
Abstract
Convolutional layers are the core building blocks of Convolutional Neural Networks (CNNs). In this paper, we propose to augment a convolutional layer with an additional depthwise convolution, where each input channel is convolved with a different 2D kernel. The composition of the two convolutions constitutes an over-parameterization, since it adds learnable parameters, while the resulting linear operation can be expressed by a single convolution layer. We refer to this depthwise over-parameterized convolutional layer as DO-Conv. We show with extensive experiments that the mere replacement of conventional convolutional layers with DO-Conv layers boosts the performance of CNNs on many classical vision tasks, such as image classification, detection, and segmentation. Moreover, in the inference phase, the depthwise convolution is folded into the conventional convolution, reducing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsConvolution · Depthwise Convolution
