REBAR: Low-variance, unbiased gradient estimates for discrete latent   variable models

George Tucker; Andriy Mnih; Chris J. Maddison; Dieterich Lawson,; Jascha Sohl-Dickstein

arXiv:1703.07370·cs.LG·November 7, 2017·167 cites

REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models

George Tucker, Andriy Mnih, Chris J. Maddison, Dieterich Lawson,, Jascha Sohl-Dickstein

PDF

Open Access 3 Repos

TL;DR

This paper introduces REBAR, a novel method that combines control variates and continuous relaxations to produce low-variance, unbiased gradient estimates for models with discrete latent variables, improving training efficiency.

Contribution

The paper presents a new control variate technique that yields unbiased, low-variance gradient estimates and an adaptive relaxation method, advancing discrete latent variable optimization.

Findings

01

Achieves state-of-the-art variance reduction on benchmark tasks.

02

Leads to faster convergence and better final log-likelihood.

03

Removes the need for hyperparameter tuning of relaxation tightness.

Abstract

Learning in models with discrete latent variables is challenging due to high variance gradient estimators. Generally, approaches have relied on control variates to reduce the variance of the REINFORCE estimator. Recent work (Jang et al. 2016, Maddison et al. 2016) has taken a different approach, introducing a continuous relaxation of discrete variables to produce low-variance, but biased, gradient estimates. In this work, we combine the two approaches through a novel control variate that produces low-variance, \emph{unbiased} gradient estimates. Then, we introduce a modification to the continuous relaxation and show that the tightness of the relaxation can be adapted online, removing it as a hyperparameter. We show state-of-the-art variance reduction on several benchmark generative modeling tasks, generally leading to faster convergence to a better final log-likelihood.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Topic Modeling · Domain Adaptation and Few-Shot Learning

MethodsREINFORCE