BMRS: Bayesian Model Reduction for Structured Pruning
Dustin Wright, Christian Igel, and Raghavendra Selvan

TL;DR
BMRS introduces a Bayesian framework for structured neural network pruning, enabling efficient model compression with high accuracy without extensive tuning, demonstrated across various datasets and architectures.
Contribution
The paper presents BMRS, a novel end-to-end Bayesian structured pruning method utilizing Bayesian model reduction for effective neural network compression.
Findings
BMRS achieves high compression rates with maintained accuracy.
Two variants of BMRS offer different trade-offs between compression and performance.
Experimental results show competitive performance compared to existing pruning methods.
Abstract
Modern neural networks are often massively overparameterized leading to high compute costs during training and at inference. One effective method to improve both the compute and energy efficiency of neural networks while maintaining good performance is structured pruning, where full network structures (e.g.~neurons or convolutional filters) that have limited impact on the model output are removed. In this work, we propose Bayesian Model Reduction for Structured pruning (BMRS), a fully end-to-end Bayesian method of structured pruning. BMRS is based on two recent methods: Bayesian structured pruning with multiplicative noise, and Bayesian model reduction (BMR), a method which allows efficient comparison of Bayesian models under a change in prior. We present two realizations of BMRS derived from different priors which yield different structured pruning characteristics: 1) BMRS_N with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsSpeech Recognition and Synthesis · Anomaly Detection Techniques and Applications · Time Series Analysis and Forecasting
MethodsPruning
