Bayesian inference for the learning rate in Generalised Bayesian inference

Jeong Eun Lee; Sitong Liu; Geoff K. Nicholls

arXiv:2506.12532·stat.ME·May 18, 2026

Bayesian inference for the learning rate in Generalised Bayesian inference

Jeong Eun Lee, Sitong Liu, Geoff K. Nicholls

PDF

TL;DR

This paper introduces a Bayesian approach to estimate hyperparameters in Generalised Bayesian Inference, enabling joint uncertainty quantification and improved hyperparameter selection.

Contribution

It defines two new hyperparameter posteriors based on utility and pseudo-true parameters, supporting joint estimation and uncertainty quantification.

Findings

01

GBI-posteriors outperform standard Bayesian inference on simulated data.

02

The method effectively selects near-optimal hyperparameters in text analysis.

03

Supports combining multiple datasets with uncertainty quantification.

Abstract

In Generalised Bayesian Inference (GBI), the learning rate and hyperparameters of the loss must be estimated. These inference-hyperparameters can't be estimated jointly with the other parameters, from the data, by giving them a prior. However, in some settings there exist unknown ``true'' hyperparameter-values about which it is meaningful to have prior belief. It is then possible to use Bayesian inference with held-out data to get hyperparameter-posteriors. We define two hyperparameter posteriors, one based on an ELPPD-utility and one aiming to cover the pseudo-true parameter. The new framework supports estimation and uncertainty quantification for multiple hyperparameters jointly. Experiments show that the resulting GBI-posteriors out-perform Bayesian inference on simulated test data and select optimal or near optimal hyperparameter values in a large real problem of text analysis.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms