Least squares variational inference
Yvann Le Fay, Nicolas Chopin, Simon Barthelm\'e

TL;DR
This paper introduces LSVI, a gradient-free variational inference method based on fixed-point equations for exponential family approximations, with convergence guarantees and improved efficiency for Gaussian cases.
Contribution
The paper proposes LSVI, a novel Monte Carlo variational inference algorithm that is gradient-free, based on fixed-point equations, and provides convergence analysis and efficiency improvements.
Findings
LSVI outperforms existing methods in various examples.
LSVI is gradient-free, avoiding gradient computations.
Efficiency varies with Gaussian approximation complexity.
Abstract
Variational inference consists in finding the best approximation of a target distribution within a certain family, where `best' means (typically) smallest Kullback-Leiber divergence. We show that, when the approximation family is exponential, the best approximation is the solution of a fixed-point equation. We introduce LSVI (Least-Squares Variational Inference), a Monte Carlo variant of the corresponding fixed-point recursion, where each iteration boils down to ordinary least squares regression and does not require computing gradients. We show that LSVI is equivalent to stochastic mirror descent; we use this insight to derive convergence guarantees. We introduce various ideas to improve LSVI further when the approximation family is Gaussian, leading to a complexity in the dimension of the target in the full-covariance case, and a complexity in the mean-field case.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
