Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel
SungYub Kim, Sihwan Park, Kyungsu Kim, Eunho Yang

TL;DR
This paper introduces a scale-invariant Bayesian neural network framework that improves generalization bounds and uncertainty calibration by decomposing parameters into scale and connectivity, addressing issues caused by parameter scaling.
Contribution
It proposes a novel prior and posterior distribution invariant to parameter scaling, enabling more accurate generalization bounds and uncertainty calibration for practical neural network transformations.
Findings
Invariant posterior improves flatness measures
Enhanced uncertainty calibration in Bayesian neural networks
Effective in practical parameter transformation scenarios
Abstract
Explaining generalizations and preventing over-confident predictions are central goals of studies on the loss landscape of neural networks. Flatness, defined as loss invariability on perturbations of a pre-trained solution, is widely accepted as a predictor of generalization in this context. However, the problem that flatness and generalization bounds can be changed arbitrarily according to the scale of a parameter was pointed out, and previous studies partially solved the problem with restrictions: Counter-intuitively, their generalization bounds were still variant for the function-preserving parameter scaling transformation or limited only to an impractical network structure. As a more fundamental solution, we propose new prior and posterior distributions invariant to scaling transformations by \textit{decomposing} the scale and connectivity of parameters, thereby allowing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · Advanced Neural Network Applications
MethodsWeight Decay
