The equivalence between Stein variational gradient descent and black-box   variational inference

Casey Chu; Kentaro Minami; Kenji Fukumizu

arXiv:2004.01822·cs.LG·April 7, 2020·1 cites

The equivalence between Stein variational gradient descent and black-box variational inference

Casey Chu, Kentaro Minami, Kenji Fukumizu

PDF

Open Access

TL;DR

This paper demonstrates the formal equivalence between Stein variational gradient descent (SVGD) and black-box variational inference (BBVI) under certain kernel conditions, unifying various inference and generative modeling techniques.

Contribution

It establishes a precise connection between SVGD and BBVI using the neural tangent kernel, and interprets both as kernel gradient flows, providing a unified theoretical framework.

Findings

01

BBVI corresponds exactly to SVGD with the neural tangent kernel

02

Both methods can be viewed as kernel gradient flows in probability space

03

Kernel gradient flow dynamics relate to GAN training processes

Abstract

We formalize an equivalence between two popular methods for Bayesian inference: Stein variational gradient descent (SVGD) and black-box variational inference (BBVI). In particular, we show that BBVI corresponds precisely to SVGD when the kernel is the neural tangent kernel. Furthermore, we interpret SVGD and BBVI as kernel gradient flows; we do this by leveraging the recent perspective that views SVGD as a gradient flow in the space of probability distributions and showing that BBVI naturally motivates a Riemannian structure on that space. We observe that kernel gradient flow also describes dynamics found in the training of generative adversarial networks (GANs). This work thereby unifies several existing techniques in variational inference and generative modeling and identifies the kernel as a fundamental object governing the behavior of these algorithms, motivating deeper analysis of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Model Reduction and Neural Networks