Depth-wise Decomposition for Accelerating Separable Convolutions in Efficient Convolutional Neural Networks
Yihui He, Jianing Qian, Jianren Wang, Cindy X. Le, Congrui Hetang, Qi, Lyu, Wenping Wang, Tianwei Yue

TL;DR
This paper introduces a novel SVD-based depth-wise decomposition method to accelerate separable convolutions in CNNs, maintaining high accuracy while reducing inference latency, demonstrated on ShuffleNet V2 and ImageNet.
Contribution
It proposes a new decomposition technique using GSVD to expand regular convolutions into efficient depthwise separable convolutions with minimal accuracy loss.
Findings
Outperforms channel decomposition on all tested datasets.
Improves ShuffleNet V2 Top-1 accuracy by approximately 2%.
Effective on both synthetic and large-scale datasets.
Abstract
Very deep convolutional neural networks (CNNs) have been firmly established as the primary methods for many computer vision tasks. However, most state-of-the-art CNNs are large, which results in high inference latency. Recently, depth-wise separable convolution has been proposed for image recognition tasks on computationally limited platforms such as robotics and self-driving cars. Though it is much faster than its counterpart, regular convolution, accuracy is sacrificed. In this paper, we propose a novel decomposition approach based on SVD, namely depth-wise decomposition, for expanding regular convolutions into depthwise separable convolutions while maintaining high accuracy. We show our approach can be further generalized to the multi-channel and multi-layer cases, based on Generalized Singular Value Decomposition (GSVD) [59]. We conduct thorough experiments with the latest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsGrouped Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Depthwise Convolution · Pointwise Convolution · Residual Connection · Average Pooling · Channel Shuffle · Groupwise Point Convolution · Global Average Pooling
