DiffPrune: Neural Network Pruning with Deterministic Approximate Binary Gates and $L_0$ Regularization
Yaniv Shulman

TL;DR
DiffPrune introduces a differentiable approach for neural network pruning using deterministic approximate binary gates and $L_0$ regularization, enabling efficient model sparsity and flexible pruning strategies.
Contribution
It presents a novel deterministic approximation of Bernoulli variables and a method for model selection with binary gates, facilitating differentiable and flexible pruning.
Findings
Effective sparsity achieved with $L_0$ regularization.
Supports arbitrary group sparsity for flexible pruning.
Demonstrated success on neural network pruning tasks.
Abstract
Modern neural network architectures typically have many millions of parameters and can be pruned significantly without substantial loss in effectiveness which demonstrates they are over-parameterized. The contribution of this work is two-fold. The first is a method for approximating a multivariate Bernoulli random variable by means of a deterministic and differentiable transformation of any real-valued multivariate random variable. The second is a method for model selection by element-wise multiplication of parameters with approximate binary gates that may be computed deterministically or stochastically and take on exact zero values. Sparsity is encouraged by the inclusion of a surrogate regularization to the loss. Since the method is differentiable it enables straightforward and efficient learning of model architectures by an empirical risk minimization procedure with stochastic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Gaussian Processes and Bayesian Inference
MethodsBatch Normalization · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Convolution · Wide Residual Block · WideResNet
