Leveraging Recursive Gumbel-Max Trick for Approximate Inference in   Combinatorial Spaces

Kirill Struminsky; Artyom Gadetsky; Denis Rakitin; Danil Karpushkin,; Dmitry Vetrov

arXiv:2110.15072·cs.LG·October 29, 2021

Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin, Danil Karpushkin,, Dmitry Vetrov

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a recursive Gumbel-Max trick extension for structured latent variables, enabling unbiased gradient estimation and improved inference in combinatorial spaces without relying on biased surrogates.

Contribution

It presents a novel recursive Gumbel-Max approach that provides unbiased gradient estimates for structured latent variables, avoiding the limitations of differentiable surrogates.

Findings

01

Achieves competitive results with relaxation-based methods

02

Introduces a family of stochastic invariant recursive algorithms

03

Provides reliable gradient estimates without additional constraints

Abstract

Structured latent variables allow incorporating meaningful prior knowledge into deep learning models. However, learning with such variables remains challenging because of their discrete nature. Nowadays, the standard learning approach is to define a latent variable as a perturbed algorithm output and to use a differentiable surrogate for training. In general, the surrogate puts additional constraints on the model and inevitably leads to biased gradients. To alleviate these shortcomings, we extend the Gumbel-Max trick to define distributions over structured domains. We avoid the differentiable surrogates by leveraging the score function estimators for optimization. In particular, we highlight a family of recursive algorithms with a common feature we call stochastic invariant. The feature allows us to construct reliable gradient estimates and control variates without additional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

RakitinDen/pytorch-recursive-gumbel-max-trick
pytorchOfficial

Videos

Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications