Adaptive Parallel Tempering for Stochastic Maximum Likelihood Learning   of RBMs

Guillaume Desjardins; Aaron Courville; Yoshua Bengio

arXiv:1012.3476·stat.ML·December 17, 2010·29 cites

Adaptive Parallel Tempering for Stochastic Maximum Likelihood Learning of RBMs

Guillaume Desjardins, Aaron Courville, Yoshua Bengio

PDF

Open Access

TL;DR

This paper introduces an adaptive method for Parallel Tempering in training RBMs, automating temperature selection and chain spawning to improve sampling efficiency and likelihood scores without extra computational cost.

Contribution

It proposes an automated temperature optimization and dynamic chain spawning technique for Parallel Tempering in RBM training, reducing manual tuning and computational overhead.

Findings

01

Improved likelihood scores on synthetic data

02

Automated temperature selection enhances sampling efficiency

03

Dynamic chain spawning reduces computational overhead

Abstract

Restricted Boltzmann Machines (RBM) have attracted a lot of attention of late, as one the principle building blocks of deep networks. Training RBMs remains problematic however, because of the intractibility of their partition function. The maximum likelihood gradient requires a very robust sampler which can accurately sample from the model despite the loss of ergodicity often incurred during learning. While using Parallel Tempering in the negative phase of Stochastic Maximum Likelihood (SML-PT) helps address the issue, it imposes a trade-off between computational complexity and high ergodicity, and requires careful hand-tuning of the temperatures. In this paper, we show that this trade-off is unnecessary. The choice of optimal temperatures can be automated by minimizing average return time (a concept first proposed by [Katzgraber et al., 2006]) while chains can be spawned dynamically,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Generative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics