On the geometry of Stein variational gradient descent
A. Duncan, N. Nuesken, L. Szpruch

TL;DR
This paper explores the geometric structure of Stein variational gradient descent, analyzing its convergence and kernel choices, and demonstrates improved performance with novel kernel functions in high-dimensional Bayesian inference.
Contribution
It provides a geometric perspective on SVGD, investigates kernel selection, and introduces nondifferentiable kernels with adjusted tails for better convergence.
Findings
Nondifferentiable kernels with adjusted tails improve convergence.
Geometric analysis clarifies the role of kernel choice in SVGD.
Numerical experiments show significant performance gains.
Abstract
Bayesian inference problems require sampling or approximating high-dimensional probability distributions. The focus of this paper is on the recently introduced Stein variational gradient descent methodology, a class of algorithms that rely on iterated steepest descent steps with respect to a reproducing kernel Hilbert space norm. This construction leads to interacting particle systems, the mean-field limit of which is a gradient flow on the space of probability distributions equipped with a certain geometrical structure. We leverage this viewpoint to shed some light on the convergence properties of the algorithm, in particular addressing the problem of choosing a suitable positive definite kernel function. Our analysis leads us to considering certain nondifferentiable kernels with adjusted tails. We demonstrate significant performance gains of these in various numerical experiments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Bayesian Methods and Mixture Models · Point processes and geometric inequalities
