The Generalized Reparameterization Gradient

Francisco J. R. Ruiz; Michalis K. Titsias; David M. Blei

arXiv:1610.02287·stat.ML·October 20, 2016·NeurIPS·27 cites

The Generalized Reparameterization Gradient

Francisco J. R. Ruiz, Michalis K. Titsias, David M. Blei

PDF

Open Access

TL;DR

The paper introduces the generalized reparameterization gradient, extending the reparameterization method to a broader class of distributions, enabling low-variance Monte Carlo gradients for variational inference.

Contribution

It proposes a novel generalized reparameterization technique that applies to more distributions, combining reparameterization and score function gradients for improved variational inference.

Findings

01

Effective with a single sample for low-variance gradients

02

Applicable to complex probabilistic models

03

Extends reparameterization to non-Gaussian distributions

Abstract

The reparameterization gradient has become a widely used method to obtain Monte Carlo gradients to optimize the variational objective. However, this technique does not easily apply to commonly used distributions such as beta or gamma without further approximations, and most practical applications of the reparameterization gradient fit Gaussian distributions. In this paper, we introduce the generalized reparameterization gradient, a method that extends the reparameterization gradient to a wider class of variational distributions. Generalized reparameterizations use invertible transformations of the latent variables which lead to transformed distributions that weakly depend on the variational parameters. This results in new Monte Carlo gradients that combine reparameterization gradients and score function gradients. We demonstrate our approach on variational inference for two complex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models · Generative Adversarial Networks and Image Synthesis