Pruning CNN's with linear filter ensembles
Csan\'ad S\'andor, Szabolcs P\'avel, Lehel Csat\'o

TL;DR
This paper introduces a novel CNN pruning method based on filter importance norms derived from empirical loss changes, enabling significant parameter and FLOP reduction with minimal accuracy loss.
Contribution
The paper proposes a new filter importance measure based on empirical loss change and a sampling approach to create filter ensembles for effective pruning.
Findings
Removed 60% of parameters from ResNet on CIFAR-10
Achieved 64% FLOP reduction with less than 0.6% accuracy drop
Outperformed traditional norm-based pruning methods
Abstract
Despite the promising results of convolutional neural networks (CNNs), their application on devices with limited resources is still a big challenge; this is mainly due to the huge memory and computation requirements of the CNN. To counter the limitation imposed by the network size, we use pruning to reduce the network size and -- implicitly -- the number of floating point operations (FLOPs). Contrary to the filter norm method -- used in ``conventional`` network pruning -- based on the assumption that a smaller norm implies ``less importance'' to its associated component, we develop a novel filter importance norm that is based on the change in the empirical loss caused by the presence or removal of a component from the network architecture. Since there are too many individual possibilities for filter configuration, we repeatedly sample from these architectural components and measure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsPruning · Average Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling
