Dynamic Probabilistic Pruning: A general framework for hardware-constrained pruning at different granularities
Lizeth Gonzalez-Carabarin, Iris A.M. Huijben, Bastiaan S. Veeling,, Alexandre Schmid, Ruud J.G. van Sloun

TL;DR
This paper introduces Dynamic Probabilistic Pruning (DPP), a flexible pruning framework that enables efficient, hardware-friendly neural network compression at various granularities while maintaining high accuracy and allowing joint optimization with quantization.
Contribution
DPP is a novel differentiable pruning method that supports multiple granularities and integrates with quantization, improving compression efficiency and hardware implementation flexibility.
Findings
Achieves competitive compression rates and accuracy on image classification benchmarks.
Enables joint optimization of pruning and weight quantization.
Provides new metrics for analyzing pruning confidence and diversity.
Abstract
Unstructured neural network pruning algorithms have achieved impressive compression rates. However, the resulting - typically irregular - sparse matrices hamper efficient hardware implementations, leading to additional memory usage and complex control logic that diminishes the benefits of unstructured pruning. This has spurred structured coarse-grained pruning solutions that prune entire filters or even layers, enabling efficient implementation at the expense of reduced flexibility. Here we propose a flexible new pruning mechanism that facilitates pruning at different granularities (weights, kernels, filters/feature maps), while retaining efficient memory organization (e.g. pruning exactly k-out-of-n weights for every output neuron, or pruning exactly k-out-of-n kernels for every feature map). We refer to this algorithm as Dynamic Probabilistic Pruning (DPP). DPP leverages the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Machine Learning and Data Classification
MethodsPruning
