Deep Tempering
Guillaume Desjardins, Heng Luo, Aaron Courville, Yoshua Bengio

TL;DR
This paper introduces a novel sampling method called Deep Tempering, which uses hierarchical models and parallel tempering to improve mixing efficiency in RBM training, leading to better sampling quality.
Contribution
It proposes Deep Tempering, a new sampling approach leveraging deep hierarchical models and parallel tempering to enhance RBM training efficiency.
Findings
Improved mixing in Gibbs sampling for RBMs.
Enhanced sampling efficiency with Deep Tempering.
Experimental validation shows better training performance.
Abstract
Restricted Boltzmann Machines (RBMs) are one of the fundamental building blocks of deep learning. Approximate maximum likelihood training of RBMs typically necessitates sampling from these models. In many training scenarios, computationally efficient Gibbs sampling procedures are crippled by poor mixing. In this work we propose a novel method of sampling from Boltzmann machines that demonstrates a computationally efficient way to promote mixing. Our approach leverages an under-appreciated property of deep generative models such as the Deep Belief Network (DBN), where Gibbs sampling from deeper levels of the latent variable hierarchy results in dramatically increased ergodicity. Our approach is thus to train an auxiliary latent hierarchical model, based on the DBN. When used in conjunction with parallel-tempering, the method is asymptotically guaranteed to simulate samples from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Music and Audio Processing · Model Reduction and Neural Networks
MethodsDeep Belief Network
