Implicit Filter Sparsification In Convolutional Neural Networks

Dushyant Mehta; Kwang In Kim; Christian Theobalt

arXiv:1905.04967·cs.LG·May 14, 2019·1 cites

Implicit Filter Sparsification In Convolutional Neural Networks

Dushyant Mehta, Kwang In Kim, Christian Theobalt

PDF

Open Access

TL;DR

This paper investigates how implicit filter sparsity naturally occurs in CNNs with Batch Normalization and ReLU, revealing mechanisms that can inform filter pruning strategies.

Contribution

It provides an empirical analysis of implicit filter sparsification in CNNs, linking it to existing heuristics and suggesting new pruning approaches.

Findings

01

Implicit sparsity emerges in CNN filters during training.

02

Selective feature pruning contributes to sparsity.

03

Implications for filter pruning heuristics.

Abstract

We show implicit filter level sparsity manifests in convolutional neural networks (CNNs) which employ Batch Normalization and ReLU activation, and are trained with adaptive gradient descent techniques and L2 regularization or weight decay. Through an extensive empirical study (Mehta et al., 2019) we hypothesize the mechanism behind the sparsification process, and find surprising links to certain filter sparsification heuristics proposed in literature. Emergence of, and the subsequent pruning of selective features is observed to be one of the contributing mechanisms, leading to feature sparsity at par or better than certain explicit sparsification / pruning approaches. In this workshop article we summarize our findings, and point out corollaries of selective-featurepenalization which could also be employed as heuristics for filter pruning

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Image and Signal Denoising Methods

MethodsPruning · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization