Scale-invariant Bayesian Neural Networks with Connectivity Tangent   Kernel

SungYub Kim; Sihwan Park; Kyungsu Kim; Eunho Yang

arXiv:2209.15208·cs.LG·October 3, 2022

Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel

SungYub Kim, Sihwan Park, Kyungsu Kim, Eunho Yang

PDF

Open Access 1 Video

TL;DR

This paper introduces a scale-invariant Bayesian neural network framework that improves generalization bounds and uncertainty calibration by decomposing parameters into scale and connectivity, addressing issues caused by parameter scaling.

Contribution

It proposes a novel prior and posterior distribution invariant to parameter scaling, enabling more accurate generalization bounds and uncertainty calibration for practical neural network transformations.

Findings

01

Invariant posterior improves flatness measures

02

Enhanced uncertainty calibration in Bayesian neural networks

03

Effective in practical parameter transformation scenarios

Abstract

Explaining generalizations and preventing over-confident predictions are central goals of studies on the loss landscape of neural networks. Flatness, defined as loss invariability on perturbations of a pre-trained solution, is widely accepted as a predictor of generalization in this context. However, the problem that flatness and generalization bounds can be changed arbitrarily according to the scale of a parameter was pointed out, and previous studies partially solved the problem with restrictions: Counter-intuitively, their generalization bounds were still variant for the function-preserving parameter scaling transformation or limited only to an impractical network structure. As a more fundamental solution, we propose new prior and posterior distributions invariant to scaling transformations by \textit{decomposing} the scale and connectivity of parameters, thereby allowing the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · Advanced Neural Network Applications

MethodsWeight Decay