On the Convergence of Black-Box Variational Inference

Kyurae Kim; Jisu Oh; Kaiwen Wu; Yi-An Ma; Jacob R. Gardner

arXiv:2305.15349·cs.LG·January 12, 2024·2 cites

On the Convergence of Black-Box Variational Inference

Kyurae Kim, Jisu Oh, Kaiwen Wu, Yi-An Ma, Jacob R. Gardner

PDF

Open Access 1 Video

TL;DR

This paper proves the first convergence guarantees for full black-box variational inference (BBVI) without simplifying assumptions, showing that certain design choices affect convergence and that proximal stochastic gradient descent improves it.

Contribution

It provides the first convergence analysis for general BBVI, identifies suboptimal design choices, and demonstrates the effectiveness of proximal SGD in practice.

Findings

01

Proximal SGD achieves the strongest convergence guarantees for BBVI.

02

Certain nonlinear parameterizations can slow convergence.

03

Empirical results favor proximal SGD over standard BBVI implementations.

Abstract

We provide the first convergence guarantee for full black-box variational inference (BBVI), also known as Monte Carlo variational inference. While preliminary investigations worked on simplified versions of BBVI (e.g., bounded domain, bounded support, only optimizing for the scale, and such), our setup does not need any such algorithmic modifications. Our results hold for log-smooth posterior densities with and without strong log-concavity and the location-scale variational family. Also, our analysis reveals that certain algorithm design choices commonly employed in practice, particularly, nonlinear parameterizations of the scale of the variational approximation, can result in suboptimal convergence rates. Fortunately, running BBVI with proximal stochastic gradient descent fixes these limitations, and thus achieves the strongest known convergence rate guarantees. We evaluate this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

On the Convergence of Black-Box Variational Inference· slideslive

Taxonomy

TopicsMachine Learning and Algorithms · Statistical Methods and Inference · Domain Adaptation and Few-Shot Learning

MethodsVariational Inference · Stochastic Gradient Descent