Isotropic SGD: a Practical Approach to Bayesian Posterior Sampling

Giulio Franzese; Rosa Candela; Dimitrios Milios; Maurizio Filippone,; Pietro Michiardi

arXiv:2006.05087·cs.LG·June 11, 2020·1 cites

Isotropic SGD: a Practical Approach to Bayesian Posterior Sampling

Giulio Franzese, Rosa Candela, Dimitrios Milios, Maurizio Filippone,, Pietro Michiardi

PDF

Open Access

TL;DR

This paper introduces a practical method for Bayesian posterior sampling using isotropic stochastic gradient noise, improving upon existing algorithms by requiring weaker assumptions and maintaining competitive performance.

Contribution

It proposes a novel approach to make SG noise isotropic with a fixed learning rate, simplifying and enhancing the practicality of SG-MCMC algorithms.

Findings

01

Competitive performance with state-of-the-art SG-MCMC methods

02

Requires weaker assumptions than existing algorithms

03

More practical to implement and tune

Abstract

In this work we define a unified mathematical framework to deepen our understanding of the role of stochastic gradient (SG) noise on the behavior of Markov chain Monte Carlo sampling (SGMCMC) algorithms. Our formulation unlocks the design of a novel, practical approach to posterior sampling, which makes the SG noise isotropic using a fixed learning rate that we determine analytically, and that requires weaker assumptions than existing algorithms. In contrast, the common traits of existing \sgmcmc algorithms is to approximate the isotropy condition either by drowning the gradients in additive noise (annealing the learning rate) or by making restrictive assumptions on the \sg noise covariance and the geometry of the loss landscape. Extensive experimental validations indicate that our proposal is competitive with the state-of-the-art on \sgmcmc, while being much more practical to use.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models