Bayesian sparsification for deep neural networks with Bayesian model   reduction

Dimitrije Markovi\'c; Karl J. Friston; and Stefan J. Kiebel

arXiv:2309.12095·stat.ML·July 30, 2024

Bayesian sparsification for deep neural networks with Bayesian model reduction

Dimitrije Markovi\'c, Karl J. Friston, and Stefan J. Kiebel

PDF

Open Access 1 Repo

TL;DR

This paper introduces Bayesian model reduction as an efficient alternative for sparsifying deep neural networks, enabling post-hoc pruning of redundant weights with competitive performance across diverse architectures.

Contribution

It proposes Bayesian model reduction for neural network sparsification, reducing computational demands compared to traditional hierarchical prior methods.

Findings

01

BMR outperforms hierarchical horseshoe priors in pruning efficiency.

02

BMR is applicable across various architectures, including CNNs and transformers.

03

Post-hoc pruning with BMR maintains competitive performance.

Abstract

Deep learning's immense capabilities are often constrained by the complexity of its models, leading to an increasing demand for effective sparsification techniques. Bayesian sparsification for deep learning emerges as a crucial approach, facilitating the design of models that are both computationally efficient and competitive in terms of performance across various deep learning applications. The state-of-the-art -- in Bayesian sparsification of deep neural networks -- combines structural shrinkage priors on model weights with an approximate inference scheme based on stochastic variational inference. However, model inversion of the full generative model is exceptionally computationally demanding, especially when compared to standard deep learning of point estimates. In this context, we advocate for the use of Bayesian model reduction (BMR) as a more efficient alternative for pruning of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dimarkov/bmr4pml
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification

MethodsVariational Inference · Pruning