Structured Bayesian Pruning via Log-Normal Multiplicative Noise

Kirill Neklyudov; Dmitry Molchanov; Arsenii Ashukha; Dmitry Vetrov

arXiv:1705.07283·stat.ML·November 7, 2017·NeurIPS·68 cites

Structured Bayesian Pruning via Log-Normal Multiplicative Noise

Kirill Neklyudov, Dmitry Molchanov, Arsenii Ashukha, Dmitry Vetrov

PDF

Open Access 5 Repos

TL;DR

This paper introduces a Bayesian approach to structured neural network pruning using log-normal noise, enabling automatic removal of neurons or channels for faster inference while maintaining accuracy.

Contribution

It proposes a novel Bayesian model with structured sparsity that considers network architecture, using a truncated log-uniform prior and a closed-form variational approximation.

Findings

01

Achieves significant acceleration on various deep neural networks.

02

Automatically removes neurons or channels based on SNR.

03

Easy to implement as a dropout-like layer.

Abstract

Dropout-based regularization methods can be regarded as injecting random noise with pre-defined magnitude to different parts of the neural network during training. It was recently shown that Bayesian dropout procedure not only improves generalization but also leads to extremely sparse neural architectures by automatically setting the individual noise magnitude per weight. However, this sparsity can hardly be used for acceleration since it is unstructured. In the paper, we propose a new Bayesian model that takes into account the computational structure of neural networks and provides structured sparsity, e.g. removes neurons and/or convolutional channels in CNNs. To do this we inject noise to the neurons outputs while keeping the weights unregularized. We establish the probabilistic model with a proper truncated log-uniform prior over the noise and truncated log-normal variational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Gaussian Processes and Bayesian Inference

MethodsDropout