Feature Preserving Shrinkage on Bayesian Neural Networks via the R2D2 Prior

Tsai Hor Chan; Dora Yan Zhang; Guosheng Yin; Lequan Yu

arXiv:2505.18280·cs.LG·May 27, 2025

Feature Preserving Shrinkage on Bayesian Neural Networks via the R2D2 Prior

Tsai Hor Chan, Dora Yan Zhang, Guosheng Yin, Lequan Yu

PDF

Open Access

TL;DR

This paper introduces R2D2-Net, a Bayesian neural network with a novel prior that effectively shrinks irrelevant weights while preserving important features, improving uncertainty estimation and predictive accuracy.

Contribution

The paper proposes the R2D2 prior for BNNs and a variational Gibbs inference algorithm, enhancing weight shrinkage control and posterior approximation accuracy.

Findings

01

Effective shrinkage of irrelevant coefficients.

02

Improved uncertainty estimation in image classification.

03

Theoretical analysis of posterior concentration rates.

Abstract

Bayesian neural networks (BNNs) treat neural network weights as random variables, which aim to provide posterior uncertainty estimates and avoid overfitting by performing inference on the posterior weights. However, the selection of appropriate prior distributions remains a challenging task, and BNNs may suffer from catastrophic inflated variance or poor predictive performance when poor choices are made for the priors. Existing BNN designs apply different priors to weights, while the behaviours of these priors make it difficult to sufficiently shrink noisy signals or they are prone to overshrinking important signals in the weights. To alleviate this problem, we propose a novel R2D2-Net, which imposes the R^2-induced Dirichlet Decomposition (R2D2) prior to the BNN weights. The R2D2-Net can effectively shrink irrelevant coefficients towards zero, while preventing key features from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems · Neural Networks and Applications · Anomaly Detection Techniques and Applications