Differentiable Channel Sparsity Search via Weight Sharing within Filters
Yu Zhao, Chung-Kuei Lee

TL;DR
This paper introduces DCSS, a differentiable method for automatically optimizing channel sparsity in CNNs, using weight sharing to reduce resource use and improve performance across various tasks.
Contribution
The paper presents a novel differentiable channel sparsity search method that automatically finds optimal sparsity configurations, with a new weight sharing technique to prevent shape mismatching.
Findings
DCSS achieves state-of-the-art results in image classification.
Task-specific search outperforms model transfer in segmentation and super resolution.
The method is efficient and widely applicable across tasks.
Abstract
In this paper, we propose the differentiable channel sparsity search (DCSS) for convolutional neural networks. Unlike traditional channel pruning algorithms which require users to manually set prune ratios for each convolutional layer, DCSS automatically searches the optimal combination of sparsities. Inspired by the differentiable architecture search (DARTS), we draw lessons from the continuous relaxation and leverage the gradient information to balance the computational cost and metrics. Since directly applying the scheme of DARTS causes shape mismatching and excessive memory consumption, we introduce a novel technique called weight sharing within filters. This technique elegantly eliminates the problem of shape mismatching with negligible additional resources. We conduct comprehensive experiments on not only image classification but also find-grained tasks including semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
MethodsPruning · Differentiable Architecture Search
