Sticking the Landing: Simple, Lower-Variance Gradient Estimators for   Variational Inference

Geoffrey Roeder; Yuhuai Wu; David Duvenaud

arXiv:1703.09194·stat.ML·May 30, 2017·73 cites

Sticking the Landing: Simple, Lower-Variance Gradient Estimators for Variational Inference

Geoffrey Roeder, Yuhuai Wu, David Duvenaud

PDF

Open Access 1 Repo

TL;DR

This paper introduces a simplified, low-variance gradient estimator for variational inference that remains unbiased and becomes more accurate as the approximation improves, with theoretical and empirical validation.

Contribution

It presents a novel gradient estimator that reduces variance by removing a score function term, applicable to complex variational distributions, enhancing inference stability.

Findings

01

Variance approaches zero near the true posterior

02

Unbiased estimator with lower variance than standard methods

03

Effective for complex variational distributions

Abstract

We propose a simple and general variant of the standard reparameterized gradient estimator for the variational evidence lower bound. Specifically, we remove a part of the total derivative with respect to the variational parameters that corresponds to the score function. Removing this term produces an unbiased gradient estimator whose variance approaches zero as the approximate posterior approaches the exact posterior. We analyze the behavior of this gradient estimator theoretically and empirically, and generalize it to more complex variational distributions such as mixtures and importance-weighted posteriors.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

geoffroeder/iwae
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Statistical Methods and Inference