Training Compact CNNs for Image Classification using Dynamic-coded Filter Fusion
Mingbao Lin, Bohong Chen, Fei Chao, Rongrong Ji

TL;DR
This paper introduces a novel filter fusion method called DCFF for creating compact CNNs efficiently without relying on pretrained models or sparse regularization, achieving high accuracy with fewer computations.
Contribution
The paper proposes a dynamic-coded filter fusion technique that evaluates filter importance via a KL divergence-based criterion, enabling the creation of compact CNNs without pretrained models or sparse constraints.
Findings
Achieved 93.47% top-1 accuracy on CIFAR-10 with a compact VGGNet-16.
Reduced FLOPs by 63.8% and parameters by 58.6% in ResNet-50 on ImageNet.
Demonstrated superior performance over existing filter pruning methods.
Abstract
The mainstream approach for filter pruning is usually either to force a hard-coded importance estimation upon a computation-heavy pretrained model to select "important" filters, or to impose a hyperparameter-sensitive sparse constraint on the loss objective to regularize the network training. In this paper, we present a novel filter pruning method, dubbed dynamic-coded filter fusion (DCFF), to derive compact CNNs in a computation-economical and regularization-free manner for efficient image classification. Each filter in our DCFF is firstly given an inter-similarity distribution with a temperature parameter as a filter proxy, on top of which, a fresh Kullback-Leibler divergence based dynamic-coded criterion is proposed to evaluate the filter importance. In contrast to simply keeping high-score filters in other methods, we propose the concept of filter fusion, i.e., the weighted averages…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Video Surveillance and Tracking Methods
MethodsPruning
