A Kernel Approach for Semi-implicit Variational Inference
Longlin Yu, Ziheng Cheng, Shiyue Zhang, Cheng Zhang

TL;DR
This paper introduces KSIVI, a kernel-based semi-implicit variational inference method that simplifies optimization, improves expressiveness, and demonstrates strong empirical performance in Bayesian inference tasks.
Contribution
We propose KSIVI, a tractable kernel method for semi-implicit variational inference that avoids lower-level optimization and enhances expressiveness with hierarchical extensions.
Findings
KSIVI reduces optimization complexity via explicit solutions in RKHS.
The method achieves effective inference on synthetic and real-world tasks.
Theoretical guarantees include variance bounds and generalization bounds.
Abstract
Semi-implicit variational inference (SIVI) enhances the expressiveness of variational families through hierarchical semi-implicit distributions, but the intractability of their densities makes standard ELBO-based optimization biased. Recent score-matching approaches to SIVI (SIVI-SM) address this issue via a minimax formulation, at the expense of an additional lower-level optimization problem. In this paper, we propose kernel semi-implicit variational inference (KSIVI), a principled and tractable alternative that eliminates the lower-level optimization by leveraging kernel methods. We show that when optimizing over a reproducing kernel Hilbert space, the lower-level problem admits an explicit solution, reducing the objective to the kernel Stein discrepancy (KSD). Exploiting the hierarchical structure of semi-implicit distributions, the resulting KSD objective can be efficiently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques
