Categorical Reparameterization with Gumbel-Softmax
Eric Jang, Shixiang Gu, Ben Poole

TL;DR
This paper introduces the Gumbel-Softmax distribution, a differentiable approximation for categorical variables that enables backpropagation in neural networks, improving training efficiency and performance on various tasks.
Contribution
The paper proposes the Gumbel-Softmax distribution and a gradient estimator that allows differentiable sampling from categorical variables, facilitating neural network training.
Findings
Outperforms existing gradient estimators in structured output prediction.
Enables large speedups in semi-supervised classification.
Improves unsupervised generative modeling with categorical latent variables.
Abstract
Categorical variables are a natural choice for representing discrete structure in the world. However, stochastic neural networks rarely use categorical latent variables due to the inability to backpropagate through samples. In this work, we present an efficient gradient estimator that replaces the non-differentiable sample from a categorical distribution with a differentiable sample from a novel Gumbel-Softmax distribution. This distribution has the essential property that it can be smoothly annealed into a categorical distribution. We show that our Gumbel-Softmax estimator outperforms state-of-the-art gradient estimators on structured output prediction and unsupervised generative modeling tasks with categorical latent variables, and enables large speedups on semi-supervised classification.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Generative Adversarial Networks and Image Synthesis · Computational Physics and Python Applications
MethodsGumbel Softmax
