GSNs : Generative Stochastic Networks
Guillaume Alain, Yoshua Bengio, Li Yao, Jason Yosinski, Eric, Thibodeau-Laufer, Saizheng Zhang, Pascal Vincent

TL;DR
This paper introduces Generative Stochastic Networks (GSNs), a new probabilistic modeling framework that learns transition operators of Markov chains, enabling easier training and sampling, validated through experiments on image datasets.
Contribution
The paper proposes GSNs as an alternative to maximum likelihood, providing a new training principle and theoretical justification for related models like denoising auto-encoders.
Findings
GSNs can effectively model data distributions with fewer modes.
They enable training with backpropagation without layerwise pretraining.
Experimental results validate the theoretical advantages on image datasets.
Abstract
We introduce a novel training principle for probabilistic models that is an alternative to maximum likelihood. The proposed Generative Stochastic Networks (GSN) framework is based on learning the transition operator of a Markov chain whose stationary distribution estimates the data distribution. Because the transition distribution is a conditional distribution generally involving a small move, it has fewer dominant modes, being unimodal in the limit of small moves. Thus, it is easier to learn, more like learning to perform supervised function approximation, with gradients that can be obtained by back-propagation. The theorems provided here generalize recent work on the probabilistic interpretation of denoising auto-encoders and provide an interesting justification for dependency networks and generalized pseudolikelihood (along with defining an appropriate joint distribution and sampling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image and Signal Denoising Methods · Anomaly Detection Techniques and Applications
MethodsDeep Boltzmann Machine
