Deep Tempering

Guillaume Desjardins; Heng Luo; Aaron Courville; Yoshua Bengio

arXiv:1410.0123·cs.LG·October 2, 2014

Deep Tempering

Guillaume Desjardins, Heng Luo, Aaron Courville, Yoshua Bengio

PDF

Open Access

TL;DR

This paper introduces a novel sampling method called Deep Tempering, which uses hierarchical models and parallel tempering to improve mixing efficiency in RBM training, leading to better sampling quality.

Contribution

It proposes Deep Tempering, a new sampling approach leveraging deep hierarchical models and parallel tempering to enhance RBM training efficiency.

Findings

01

Improved mixing in Gibbs sampling for RBMs.

02

Enhanced sampling efficiency with Deep Tempering.

03

Experimental validation shows better training performance.

Abstract

Restricted Boltzmann Machines (RBMs) are one of the fundamental building blocks of deep learning. Approximate maximum likelihood training of RBMs typically necessitates sampling from these models. In many training scenarios, computationally efficient Gibbs sampling procedures are crippled by poor mixing. In this work we propose a novel method of sampling from Boltzmann machines that demonstrates a computationally efficient way to promote mixing. Our approach leverages an under-appreciated property of deep generative models such as the Deep Belief Network (DBN), where Gibbs sampling from deeper levels of the latent variable hierarchy results in dramatically increased ergodicity. Our approach is thus to train an auxiliary latent hierarchical model, based on the DBN. When used in conjunction with parallel-tempering, the method is asymptotically guaranteed to simulate samples from the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Music and Audio Processing · Model Reduction and Neural Networks

MethodsDeep Belief Network