Holistic Filter Pruning for Efficient Deep Neural Networks
Lukas Enderich, Fabian Timm, Wolfram Burgard

TL;DR
Holistic Filter Pruning (HFP) is a new method for efficiently reducing the complexity of deep neural networks by globally allocating pruning resources to meet specific size targets, achieving state-of-the-art results.
Contribution
HFP introduces a simple, global pruning approach that allows precise control over model size and complexity during training, outperforming existing methods.
Findings
Prunes 60% of multiplications in ResNet-50 on ImageNet with minimal accuracy loss.
Achieves state-of-the-art pruning performance on CIFAR-10 and ImageNet.
Enables easy implementation and accurate control of pruning rates.
Abstract
Deep neural networks (DNNs) are usually over-parameterized to increase the likelihood of getting adequate initial weights by random initialization. Consequently, trained DNNs have many redundancies which can be pruned from the model to reduce complexity and improve the ability to generalize. Structural sparsity, as achieved by filter pruning, directly reduces the tensor sizes of weights and activations and is thus particularly effective for reducing complexity. We propose "Holistic Filter Pruning" (HFP), a novel approach for common DNN training that is easy to implement and enables to specify accurate pruning rates for the number of both parameters and multiplications. After each forward pass, the current model complexity is calculated and compared to the desired target size. By gradient descent, a global solution can be found that allocates the pruning budget over the individual layers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning
