Rethinking the Smaller-Norm-Less-Informative Assumption in Channel   Pruning of Convolution Layers

Jianbo Ye; Xin Lu; Zhe Lin; James Z. Wang

arXiv:1802.00124·cs.LG·February 6, 2018·153 cites

Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruning of Convolution Layers

Jianbo Ye, Xin Lu, Zhe Lin, James Z. Wang

PDF

Open Access 3 Repos

TL;DR

This paper introduces a novel channel pruning method for CNNs that does not rely on the assumption that smaller-norm parameters are less informative, focusing instead on directly simplifying the network's computation graph.

Contribution

It proposes an end-to-end stochastic training approach to identify and prune constant channels without high-dimensional tensor sparsity, improving model efficiency.

Findings

01

Achieves competitive performance on image benchmarks.

02

Reduces computational complexity without relying on norm-based assumptions.

03

Provides a mathematically sound and reproducible pruning method.

Abstract

Model pruning has become a useful technique that improves the computational efficiency of deep learning, making it possible to deploy solutions in resource-limited scenarios. A widely-used practice in relevant work assumes that a smaller-norm parameter or feature plays a less informative role at the inference time. In this paper, we propose a channel pruning technique for accelerating the computations of deep convolutional neural networks (CNNs) that does not critically rely on this assumption. Instead, it focuses on direct simplification of the channel-to-channel computation graph of a CNN without the need of performing a computationally difficult and not-always-useful task of making high-dimensional tensors of CNN structured sparse. Our approach takes two stages: first to adopt an end-to- end stochastic training method that eventually forces the outputs of some channels to be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification

MethodsPruning