Depth-wise Decomposition for Accelerating Separable Convolutions in   Efficient Convolutional Neural Networks

Yihui He; Jianing Qian; Jianren Wang; Cindy X. Le; Congrui Hetang; Qi; Lyu; Wenping Wang; Tianwei Yue

arXiv:1910.09455·cs.CV·January 7, 2025·5 cites

Depth-wise Decomposition for Accelerating Separable Convolutions in Efficient Convolutional Neural Networks

Yihui He, Jianing Qian, Jianren Wang, Cindy X. Le, Congrui Hetang, Qi, Lyu, Wenping Wang, Tianwei Yue

PDF

Open Access

TL;DR

This paper introduces a novel SVD-based depth-wise decomposition method to accelerate separable convolutions in CNNs, maintaining high accuracy while reducing inference latency, demonstrated on ShuffleNet V2 and ImageNet.

Contribution

It proposes a new decomposition technique using GSVD to expand regular convolutions into efficient depthwise separable convolutions with minimal accuracy loss.

Findings

01

Outperforms channel decomposition on all tested datasets.

02

Improves ShuffleNet V2 Top-1 accuracy by approximately 2%.

03

Effective on both synthetic and large-scale datasets.

Abstract

Very deep convolutional neural networks (CNNs) have been firmly established as the primary methods for many computer vision tasks. However, most state-of-the-art CNNs are large, which results in high inference latency. Recently, depth-wise separable convolution has been proposed for image recognition tasks on computationally limited platforms such as robotics and self-driving cars. Though it is much faster than its counterpart, regular convolution, accuracy is sacrificed. In this paper, we propose a novel decomposition approach based on SVD, namely depth-wise decomposition, for expanding regular convolutions into depthwise separable convolutions while maintaining high accuracy. We show our approach can be further generalized to the multi-channel and multi-layer cases, based on Generalized Singular Value Decomposition (GSVD) [59]. We conduct thorough experiments with the latest…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsGrouped Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Depthwise Convolution · Pointwise Convolution · Residual Connection · Average Pooling · Channel Shuffle · Groupwise Point Convolution · Global Average Pooling