Natural gradient descent for improving variational inference based classification of radio galaxies

Devina Mohan; Anna M. M. Scaife

arXiv:2511.13224·astro-ph.IM·November 18, 2025

Natural gradient descent for improving variational inference based classification of radio galaxies

Devina Mohan, Anna M. M. Scaife

PDF

Open Access

TL;DR

This paper investigates the use of natural gradient descent, specifically the iVON algorithm, to enhance variational inference in Bayesian neural networks for radio galaxy classification, leading to better uncertainty calibration and out-of-distribution detection.

Contribution

It demonstrates that natural gradient descent improves uncertainty calibration and out-of-distribution detection in BNNs for radio galaxy classification, highlighting the importance of optimizer choice.

Findings

01

iVON improves uncertainty calibration over previous methods.

02

Models trained with iVON can distinguish out-of-distribution data.

03

The cold posterior effect persists with iVON training.

Abstract

Bayesian neural networks (BNNs) are most commonly optimised with first-order optimisers such as stochastic gradient descent. However, when optimising for parameters of probabilistic models, incorporating second order information during optimisation can lead to a more direct path in the distribution space and faster convergence. In this work we examine whether using natural gradient descent can improve the performance of variational inference based classification of radio galaxies. We use the Improved Variational Online Newton (iVON) algorithm and compare its performance against a recent benchmark for BNNs for radio galaxy classification. We find that iVON results in better uncertainty calibration out of all the methods previously considered while providing similar predictive performance to the best performing inference methods such as Hamiltonian Monte Carlo and Bayes by Backprop based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning