Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection

Mao Ye; Chengyue Gong; Lizhen Nie; Denny Zhou; Adam Klivans; and Qiang; Liu

arXiv:2003.01794·cs.LG·October 20, 2020·37 cites

Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection

Mao Ye, Chengyue Gong, Lizhen Nie, Denny Zhou, Adam Klivans, and Qiang, Liu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a provably effective greedy method for neural network pruning that guarantees the existence of good subnetworks and demonstrates its practical success on ImageNet architectures.

Contribution

It proposes a simple greedy forward selection algorithm for pruning, with theoretical guarantees and practical improvements over existing methods.

Findings

01

Greedy pruning guarantees smaller loss subnetworks in large pre-trained networks.

02

The method outperforms prior pruning techniques on ImageNet architectures.

03

Pruned subnetworks benefit from fine-tuning rather than re-initialization.

Abstract

Recent empirical works show that large deep neural networks are often highly redundant and one can find much smaller subnetworks without a significant drop of accuracy. However, most existing methods of network pruning are empirical and heuristic, leaving it open whether good subnetworks provably exist, how to find them efficiently, and if network pruning can be provably better than direct training using gradient descent. We answer these problems positively by proposing a simple greedy selection approach for finding good subnetworks, which starts from an empty network and greedily adds important neurons from the large network. This differs from the existing methods based on backward elimination, which remove redundant neurons from the large network. Theoretically, applying the greedy selection strategy on sufficiently large {pre-trained} networks guarantees to find small subnetworks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lushleaf/Network-Pruning-Greedy-Forward-Selection
pytorchOfficial

Videos

Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsPruning · Average Pooling · Global Average Pooling · 1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Bottleneck Residual Block · Max Pooling · Kaiming Initialization · Residual Connection