Efficient Semi-Implicit Variational Inference

Vincent Moens; Hang Ren; Alexandre Maraval; Rasul Tutunov; Jun Wang,; Haitham Ammar

arXiv:2101.06070·cs.LG·January 18, 2021

Efficient Semi-Implicit Variational Inference

Vincent Moens, Hang Ren, Alexandre Maraval, Rasul Tutunov, Jun Wang,, Haitham Ammar

PDF

Open Access

TL;DR

This paper introduces CI-VI, a scalable and efficient solver for semi-implicit variational inference that handles bias in nonlinear expectations, with proven convergence and successful application to complex posteriors.

Contribution

We develop a novel optimizer for SIVI that manages bias in nonlinear nested expectations and demonstrate its convergence and effectiveness in complex models.

Findings

01

Converges to a stationary point with rate O(t^{-4/5})

02

Effectively approximates complex posteriors in NLP datasets

03

Handles bias in nonlinear nested expectations

Abstract

In this paper, we propose CI-VI an efficient and scalable solver for semi-implicit variational inference (SIVI). Our method, first, maps SIVI's evidence lower bound (ELBO) to a form involving a nonlinear functional nesting of expected values and then develops a rigorous optimiser capable of correctly handling bias inherent to nonlinear nested expectations using an extrapolation-smoothing mechanism coupled with gradient sketching. Our theoretical results demonstrate convergence to a stationary point of the ELBO in general non-convex settings typically arising when using deep network models and an order of $O (t^{- \frac{4}{5}})$ gradient-bias-vanishing rate. We believe these results generalise beyond the specific nesting arising from SIVI to other forms. Finally, in a set of experiments, we demonstrate the effectiveness of our algorithm in approximating complex posteriors on various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Gaussian Processes and Bayesian Inference

MethodsVariational Inference