Robust Learning of Parsimonious Deep Neural Networks
Valentin Frank Ingmar Guenter, Athanasios Sideris

TL;DR
This paper introduces a novel simultaneous learning and pruning algorithm for neural networks that reduces computational costs and maintains accuracy by effectively identifying and removing irrelevant structures during early training stages.
Contribution
It presents a variational inference-based pruning method with a new hyper-prior, ensuring deterministic final networks and robust pruning across different initializations and network sizes.
Findings
Achieves state-of-the-art structured pruning levels
Maintains higher test accuracy compared to existing methods
Demonstrates robustness to initialization and network size
Abstract
We propose a simultaneous learning and pruning algorithm capable of identifying and eliminating irrelevant structures in a neural network during the early stages of training. Thus, the computational cost of subsequent training iterations, besides that of inference, is considerably reduced. Our method, based on variational inference principles using Gaussian scale mixture priors on neural network weights, learns the variational posterior distribution of Bernoulli random variables multiplying the units/filters similarly to adaptive dropout. Our algorithm, ensures that the Bernoulli parameters practically converge to either 0 or 1, establishing a deterministic final network. We analytically derive a novel hyper-prior distribution over the prior parameters that is crucial for their optimal selection and leads to consistent pruning levels and prediction accuracy regardless of weight…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
MethodsPruning · Variational Inference
