Bayesian sparsification for deep neural networks with Bayesian model reduction
Dimitrije Markovi\'c, Karl J. Friston, and Stefan J. Kiebel

TL;DR
This paper introduces Bayesian model reduction as an efficient alternative for sparsifying deep neural networks, enabling post-hoc pruning of redundant weights with competitive performance across diverse architectures.
Contribution
It proposes Bayesian model reduction for neural network sparsification, reducing computational demands compared to traditional hierarchical prior methods.
Findings
BMR outperforms hierarchical horseshoe priors in pruning efficiency.
BMR is applicable across various architectures, including CNNs and transformers.
Post-hoc pruning with BMR maintains competitive performance.
Abstract
Deep learning's immense capabilities are often constrained by the complexity of its models, leading to an increasing demand for effective sparsification techniques. Bayesian sparsification for deep learning emerges as a crucial approach, facilitating the design of models that are both computationally efficient and competitive in terms of performance across various deep learning applications. The state-of-the-art -- in Bayesian sparsification of deep neural networks -- combines structural shrinkage priors on model weights with an approximate inference scheme based on stochastic variational inference. However, model inversion of the full generative model is exceptionally computationally demanding, especially when compared to standard deep learning of point estimates. In this context, we advocate for the use of Bayesian model reduction (BMR) as a more efficient alternative for pruning of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
MethodsVariational Inference · Pruning
