Neural Epitome Search for Architecture-Agnostic Network Compression
Daquan Zhou, Xiaojie Jin, Qibin Hou, Kaixin Wang, Jianchao Yang,, Jiashi Feng

TL;DR
This paper introduces an auto-sampling method for CNN compression that learns sampling rules end-to-end, outperforming previous fixed-strategy methods like WSNet and achieving better accuracy and efficiency on 1D and 2D CNNs.
Contribution
The proposed auto-sampling approach automatically learns sampling strategies for network compression, enhancing performance over handcrafted methods and integrating differentiable rule learning.
Findings
Outperforms WSNet by 6.5% at the same compression ratio on 1D CNNs.
Achieves 1.47% higher accuracy than MobileNetV2 with 25% FLOPs reduction on ImageNet.
Surpasses some NAS-based methods like AMC and MnasNet in accuracy at similar compression levels.
Abstract
The recent WSNet [1] is a new model compression method through sampling filterweights from a compact set and has demonstrated to be effective for 1D convolutionneural networks (CNNs). However, the weights sampling strategy of WSNet ishandcrafted and fixed which may severely limit the expression ability of the resultedCNNs and weaken its compression ability. In this work, we present a novel auto-sampling method that is applicable to both 1D and 2D CNNs with significantperformance improvement over WSNet. Specifically, our proposed auto-samplingmethod learns the sampling rules end-to-end instead of being independent of thenetwork architecture design. With such differentiable weight sampling rule learning,the sampling stride and channel selection from the compact set are optimized toachieve better trade-off between model compression rate and performance. Wedemonstrate that at the same…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
MethodsTanh Activation · Sigmoid Activation · *Communicated@Fast*How Do I Communicate to Expedia? · Depthwise Convolution · Pointwise Convolution · Convolution · Average Pooling · Squeeze-and-Excitation Block · Global Average Pooling · Depthwise Separable Convolution
