The Information Autoencoding Family: A Lagrangian Perspective on Latent   Variable Generative Models

Shengjia Zhao; Jiaming Song; Stefano Ermon

arXiv:1806.06514·stat.ML·July 10, 2018·37 cites

The Information Autoencoding Family: A Lagrangian Perspective on Latent Variable Generative Models

Shengjia Zhao, Jiaming Song, Stefano Ermon

PDF

Open Access 2 Repos

TL;DR

This paper unifies various latent variable generative model objectives through a Lagrangian duality framework, analyzing their trade-offs and proposing a dual optimization method for Pareto optimal solutions.

Contribution

It reveals that many existing objectives are dual functions of a single primal problem and introduces a dual optimization approach to improve model training.

Findings

01

Unified understanding of multiple objectives via Lagrangian duality

02

Characterization of statistical and computational trade-offs

03

Proposed dual optimization method achieves Pareto optimality

Abstract

A large number of objectives have been proposed to train latent variable generative models. We show that many of them are Lagrangian dual functions of the same primal optimization problem. The primal problem optimizes the mutual information between latent and visible variables, subject to the constraints of accurately modeling the data distribution and performing correct amortized inference. By choosing to maximize or minimize mutual information, and choosing different Lagrange multipliers, we obtain different objectives including InfoGAN, ALI/BiGAN, ALICE, CycleGAN, beta-VAE, adversarial autoencoders, AVB, AS-VAE and InfoVAE. Based on this observation, we provide an exhaustive characterization of the statistical and computational trade-offs made by all the training objectives in this class of Lagrangian duals. Next, we propose a dual optimization method where we optimize model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Computational and Text Analysis Methods · Topic Modeling

MethodsBatch Normalization · Residual Connection · PatchGAN · *Communicated@Fast*How Do I Communicate to Expedia? · Tanh Activation · Residual Block · Instance Normalization · Convolution · HuMan(Expedia)||How do I get a human at Expedia? · Sigmoid Activation