What Regularized Auto-Encoders Learn from the Data Generating   Distribution

Guillaume Alain; Yoshua Bengio

arXiv:1211.4246·cs.LG·August 20, 2014·ICLR·101 cites

What Regularized Auto-Encoders Learn from the Data Generating Distribution

Guillaume Alain, Yoshua Bengio

PDF

Open Access

TL;DR

This paper demonstrates that regularized auto-encoders learn the score function of the data distribution, revealing their ability to characterize the local shape of the data density and enabling sampling via MCMC.

Contribution

It provides a generic theoretical framework showing auto-encoders capture the score function, independent of parametrization, and links regularized auto-encoders to score matching and sampling methods.

Findings

01

Auto-encoders learn the score (gradient of log-density) of data.

02

Theoretical results are parametrization-independent.

03

Sampling experiments confirm the ability to generate data from learned distribution.

Abstract

What do auto-encoders learn about the underlying data generating distribution? Recent work suggests that some auto-encoder variants do a good job of capturing the local manifold structure of data. This paper clarifies some of these previous observations by showing that minimizing a particular form of regularized reconstruction error yields a reconstruction function that locally characterizes the shape of the data generating density. We show that the auto-encoder captures the score (derivative of the log-density with respect to the input). It contradicts previous interpretations of reconstruction error as an energy function. Unlike previous results, the theorems provided here are completely generic and do not depend on the parametrization of the auto-encoder: they show what the auto-encoder would tend to if given enough capacity and examples. These results are for a contractive training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Lattice Boltzmann Simulation Studies