From Tail Universality to Bernstein-von Mises: A Unified Statistical Theory of Semi-Implicit Variational Inference
Sean Plummer

TL;DR
This paper develops a comprehensive statistical theory for semi-implicit variational inference (SIVI), analyzing its approximation capabilities, optimization properties, and conditions for successful posterior recovery.
Contribution
It provides the first unified theoretical framework for SIVI, detailing approximation limits, obstructions, and convergence guarantees under various conditions.
Findings
Semi-implicit families are dense in L1 under certain conditions.
Obstructions like tail-mismatch and structural restrictions affect approximation.
Empirical SIVI objectives converge to their population limits, ensuring consistency.
Abstract
Semi-implicit variational inference (SIVI) constructs approximate posteriors of the form , where the conditional kernel is parameterized and the mixing base is fixed and tractable. This paper develops a unified "approximation-optimization-statistics'' theory for such families. On the approximation side, we show that under compact L1-universality and a mild tail-dominance condition, semi-implicit families are dense in L1 and can achieve arbitrarily small forward Kullback-Leibler (KL) error. We also identify two sharp obstructions to global approximation: (i) an Orlicz tail-mismatch condition that induces a strictly positive forward-KL gap, and (ii) structural restrictions, such as non-autoregressive Gaussian kernels, that force "branch collapse'' in conditional distributions. For each obstruction we give a minimal structural modification that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Markov Chains and Monte Carlo Methods · Stochastic Gradient Optimization Techniques
