Learning Latent Permutations with Gumbel-Sinkhorn Networks

Gonzalo Mena; David Belanger; Scott Linderman; Jasper Snoek

arXiv:1802.08665·stat.ML·February 26, 2018·93 cites

Learning Latent Permutations with Gumbel-Sinkhorn Networks

Gonzalo Mena, David Belanger, Scott Linderman, Jasper Snoek

PDF

Open Access 2 Repos

TL;DR

This paper introduces Gumbel-Sinkhorn networks, a novel approach for learning latent permutations using continuous relaxations, enabling end-to-end training for tasks involving matchings and permutations.

Contribution

It extends the Gumbel-Softmax method to distributions over permutations with the Sinkhorn operator, facilitating differentiable learning of latent matchings.

Findings

01

Outperforms baselines on sorting tasks

02

Effective in solving jigsaw puzzles

03

Identifies neural signals in worms

Abstract

Permutations and matchings are core building blocks in a variety of latent variable models, as they allow us to align, canonicalize, and sort data. Learning in such models is difficult, however, because exact marginalization over these combinatorial objects is intractable. In response, this paper introduces a collection of new methods for end-to-end learning in such models that approximate discrete maximum-weight matching using the continuous Sinkhorn operator. Sinkhorn iteration is attractive because it functions as a simple, easy-to-implement analog of the softmax operator. With this, we can define the Gumbel-Sinkhorn method, an extension of the Gumbel-Softmax method (Jang et al. 2016, Maddison2016 et al. 2016) to distributions over latent matchings. We demonstrate the effectiveness of our method by outperforming competitive baselines on a range of qualitatively different tasks:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Handwritten Text Recognition Techniques

MethodsSoftmax