Natural gradient descent for improving variational inference based classification of radio galaxies
Devina Mohan, Anna M. M. Scaife

TL;DR
This paper investigates the use of natural gradient descent, specifically the iVON algorithm, to enhance variational inference in Bayesian neural networks for radio galaxy classification, leading to better uncertainty calibration and out-of-distribution detection.
Contribution
It demonstrates that natural gradient descent improves uncertainty calibration and out-of-distribution detection in BNNs for radio galaxy classification, highlighting the importance of optimizer choice.
Findings
iVON improves uncertainty calibration over previous methods.
Models trained with iVON can distinguish out-of-distribution data.
The cold posterior effect persists with iVON training.
Abstract
Bayesian neural networks (BNNs) are most commonly optimised with first-order optimisers such as stochastic gradient descent. However, when optimising for parameters of probabilistic models, incorporating second order information during optimisation can lead to a more direct path in the distribution space and faster convergence. In this work we examine whether using natural gradient descent can improve the performance of variational inference based classification of radio galaxies. We use the Improved Variational Online Newton (iVON) algorithm and compare its performance against a recent benchmark for BNNs for radio galaxy classification. We find that iVON results in better uncertainty calibration out of all the methods previously considered while providing similar predictive performance to the best performing inference methods such as Hamiltonian Monte Carlo and Bayes by Backprop based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning
