On the properties of variational approximations of Gibbs posteriors
Pierre Alquier, James Ridgway, Nicolas Chopin

TL;DR
This paper investigates the theoretical properties of variational approximations to Gibbs posteriors in PAC-Bayesian methods, showing they often retain the same convergence rates as the original intractable distributions, with practical applications demonstrated.
Contribution
It provides a general theoretical analysis of variational approximations to Gibbs posteriors, establishing their convergence properties and applicability across various learning tasks.
Findings
Variational approximations often match the convergence rate of Gibbs posteriors.
The approach is applicable to classification, ranking, and matrix completion.
Empirical results show good approximation properties on real datasets.
Abstract
The PAC-Bayesian approach is a powerful set of techniques to derive non- asymptotic risk bounds for random estimators. The corresponding optimal distribution of estimators, usually called the Gibbs posterior, is unfortunately intractable. One may sample from it using Markov chain Monte Carlo, but this is often too slow for big datasets. We consider instead variational approximations of the Gibbs posterior, which are fast to compute. We undertake a general study of the properties of such approximations. Our main finding is that such a variational approximation has often the same rate of convergence as the original PAC-Bayesian procedure it approximates. We specialise our results to several learning tasks (classification, ranking, matrix completion),discuss how to implement a variational approximation in each case, and illustrate the good properties of said approximation on real datasets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Bayesian Methods and Mixture Models · Statistical Methods and Inference
