Pruning CNN's with linear filter ensembles

Csan\'ad S\'andor; Szabolcs P\'avel; Lehel Csat\'o

arXiv:2001.08142·cs.LG·March 4, 2020·1 cites

Pruning CNN's with linear filter ensembles

Csan\'ad S\'andor, Szabolcs P\'avel, Lehel Csat\'o

PDF

Open Access

TL;DR

This paper introduces a novel CNN pruning method based on filter importance norms derived from empirical loss changes, enabling significant parameter and FLOP reduction with minimal accuracy loss.

Contribution

The paper proposes a new filter importance measure based on empirical loss change and a sampling approach to create filter ensembles for effective pruning.

Findings

01

Removed 60% of parameters from ResNet on CIFAR-10

02

Achieved 64% FLOP reduction with less than 0.6% accuracy drop

03

Outperformed traditional norm-based pruning methods

Abstract

Despite the promising results of convolutional neural networks (CNNs), their application on devices with limited resources is still a big challenge; this is mainly due to the huge memory and computation requirements of the CNN. To counter the limitation imposed by the network size, we use pruning to reduce the network size and -- implicitly -- the number of floating point operations (FLOPs). Contrary to the filter norm method -- used in ``conventional`` network pruning -- based on the assumption that a smaller norm implies ``less importance'' to its associated component, we develop a novel filter importance norm that is based on the change in the empirical loss caused by the presence or removal of a component from the network architecture. Since there are too many individual possibilities for filter configuration, we repeatedly sample from these architectural components and measure…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning

MethodsPruning · Average Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling