Stein Variational Gradient Descent with Multiple Kernel
Qingzhong Ai, Shiyu Liu, Lirong He, Zenglin Xu

TL;DR
This paper introduces a novel multiple kernel approach for Stein Variational Gradient Descent (SVGD), enhancing its flexibility and performance by automatically combining kernels and extending kernel discrepancy measures.
Contribution
It extends Kernelized Stein Discrepancy to multiple kernels and develops MK-SVGD, which adaptively combines kernels without extra parameters, improving inference accuracy.
Findings
MK-SVGD outperforms traditional SVGD methods.
Automatically assigns kernel weights without additional parameters.
Consistently matches or exceeds competing methods in experiments.
Abstract
Stein variational gradient descent (SVGD) and its variants have shown promising successes in approximate inference for complex distributions. In practice, we notice that the kernel used in SVGD-based methods has a decisive effect on the empirical performance. Radial basis function (RBF) kernel with median heuristics is a common choice in previous approaches, but unfortunately this has proven to be sub-optimal. Inspired by the paradigm of Multiple Kernel Learning (MKL), our solution to this flaw is using a combination of multiple kernels to approximate the optimal kernel, rather than a single one which may limit the performance and flexibility. Specifically, we first extend Kernelized Stein Discrepancy (KSD) to its multiple kernels view called Multiple Kernelized Stein Discrepancy (MKSD) and then leverage MKSD to construct a general algorithm Multiple Kernel SVGD (MK-SVGD). Further,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Machine Learning and ELM
