Pruning Filters for Efficient ConvNets

Hao Li; Asim Kadav; Igor Durdanovic; Hanan Samet; Hans Peter Graf

arXiv:1608.08710·cs.CV·March 13, 2017·685 cites

Pruning Filters for Efficient ConvNets

Hao Li, Asim Kadav, Igor Durdanovic, Hanan Samet, Hans Peter Graf

PDF

Open Access 5 Repos

TL;DR

This paper introduces a filter pruning method for CNNs that removes entire filters with minimal impact on accuracy, significantly reducing inference costs without requiring sparse convolution support.

Contribution

It proposes a simple filter pruning technique that accelerates CNNs by removing whole filters, avoiding irregular sparsity and enabling compatibility with existing dense matrix libraries.

Findings

01

Reduced inference costs for VGG-16 by up to 34%

02

Reduced inference costs for ResNet-110 by up to 38%

03

Achieved near-original accuracy after retraining

Abstract

The success of CNNs in various applications is accompanied by a significant increase in the computation and parameter storage costs. Recent efforts toward reducing these overheads involve pruning and compressing the weights of various layers without hurting original accuracy. However, magnitude-based pruning of weights reduces a significant number of parameters from the fully connected layers and may not adequately reduce the computation costs in the convolutional layers due to irregular sparsity in the pruned networks. We present an acceleration method for CNNs, where we prune filters from CNNs that are identified as having a small effect on the output accuracy. By removing whole filters in the network together with their connecting feature maps, the computation costs are reduced significantly. In contrast to pruning weights, this approach does not result in sparse connectivity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition

MethodsPruning · Convolution