Structured Sparsification with Joint Optimization of Group Convolution and Channel Shuffle
Xin-Yu Zhang, Kai Zhao, Taihong Xiao, Ming-Ming Cheng, and Ming-Hsuan, Yang

TL;DR
This paper introduces a structured sparsification method that automatically induces sparsity in convolutional weights and employs a learnable channel shuffle to efficiently compress CNNs with minimal performance loss.
Contribution
It presents a novel joint optimization approach for structured sparsity and channel shuffling, improving network compression efficiency and accuracy.
Findings
Achieves competitive accuracy with reduced computational complexity.
Effectively induces structured sparsity for efficient implementation.
Maintains performance with negligible accuracy drop across various architectures.
Abstract
Recent advances in convolutional neural networks(CNNs) usually come with the expense of excessive computational overhead and memory footprint. Network compression aims to alleviate this issue by training compact models with comparable performance. However, existing compression techniques either entail dedicated expert design or compromise with a moderate performance drop. In this paper, we propose a novel structured sparsification method for efficient network compression. The proposed method automatically induces structured sparsity on the convolutional weights, thereby facilitating the implementation of the compressed model with the highly-optimized group convolution. We further address the problem of inter-group communication with a learnable channel shuffle mechanism. The proposed approach can be easily applied to compress many network architectures with a negligible performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Speech and Audio Processing · Domain Adaptation and Few-Shot Learning
MethodsChannel Shuffle
