Stochastic Backpropagation through Mixture Density Distributions
Alex Graves

TL;DR
This paper introduces a novel method for backpropagating stochastic gradients through mixture density distributions, enabling more effective training of models with mixture-distributed latent variables.
Contribution
It presents an alternative transform for mixture models that allows unbiased gradient estimation, extending the reparameterization trick to mixture densities.
Findings
Enables training of variational autoencoders with mixture-distributed latent variables
Provides an unbiased estimator for mixture density weight derivatives
Facilitates stochastic variational inference with mixture density posteriors
Abstract
The ability to backpropagate stochastic gradients through continuous latent distributions has been crucial to the emergence of variational autoencoders and stochastic gradient variational Bayes. The key ingredient is an unbiased and low-variance way of estimating gradients with respect to distribution parameters from gradients evaluated at distribution samples. The "reparameterization trick" provides a class of transforms yielding such estimators for many continuous distributions, including the Gaussian and other members of the location-scale family. However the trick does not readily extend to mixture density models, due to the difficulty of reparameterizing the discrete distribution over mixture weights. This report describes an alternative transform, applicable to any continuous multivariate distribution with a differentiable density function from which samples can be drawn, and uses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis · Bayesian Methods and Mixture Models
