Pruning Convolutional Neural Networks for Resource Efficient Inference

Pavlo Molchanov; Stephen Tyree; Tero Karras; Timo Aila; Jan Kautz

arXiv:1611.06440·cs.LG·June 12, 2017·1.2k cites

Pruning Convolutional Neural Networks for Resource Efficient Inference

Pavlo Molchanov, Stephen Tyree, Tero Karras, Timo Aila, Jan Kautz

PDF

Open Access 5 Repos

TL;DR

This paper introduces a Taylor expansion-based criterion for pruning convolutional neural networks, enabling resource-efficient inference with minimal accuracy loss, especially effective in transfer learning and large-scale datasets.

Contribution

It presents a novel pruning criterion based on Taylor expansion, combined with greedy pruning and fine-tuning, improving efficiency in transfer learning scenarios.

Findings

01

Superior pruning performance over traditional criteria.

02

Achieved over 10x reduction in 3D-convolutional filters.

03

Effective on large-scale ImageNet dataset.

Abstract

We propose a new formulation for pruning convolutional kernels in neural networks to enable efficient inference. We interleave greedy criteria-based pruning with fine-tuning by backpropagation - a computationally efficient procedure that maintains good generalization in the pruned network. We propose a new criterion based on Taylor expansion that approximates the change in the cost function induced by pruning network parameters. We focus on transfer learning, where large pretrained networks are adapted to specialized tasks. The proposed criterion demonstrates superior performance compared to other criteria, e.g. the norm of kernel weights or feature map activation, for pruning large CNNs after adaptation to fine-grained classification tasks (Birds-200 and Flowers-102) relaying only on the first order gradient information. We also show that pruning can lead to more than 10x theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Human Pose and Action Recognition

MethodsPruning