Population-Contrastive-Divergence: Does Consistency help with RBM   training?

Oswin Krause; Asja Fischer; Christian Igel

arXiv:1510.01624·cs.LG·June 29, 2017

Population-Contrastive-Divergence: Does Consistency help with RBM training?

Oswin Krause, Asja Fischer, Christian Igel

PDF

TL;DR

This paper introduces Population-Contrastive-Divergence (pop-CD), a new RBM training algorithm that offers a consistent gradient estimate with lower bias, improving training accuracy over traditional Contrastive Divergence, especially on smaller models.

Contribution

The paper proposes pop-CD, a novel RBM training method based on Population Monte Carlo, providing a consistent gradient estimate with minimal additional computational cost.

Findings

01

pop-CD reduces bias compared to CD

02

pop-CD achieves higher log-likelihood values

03

pop-CD's variance increases, affecting large models

Abstract

Estimating the log-likelihood gradient with respect to the parameters of a Restricted Boltzmann Machine (RBM) typically requires sampling using Markov Chain Monte Carlo (MCMC) techniques. To save computation time, the Markov chains are only run for a small number of steps, which leads to a biased estimate. This bias can cause RBM training algorithms such as Contrastive Divergence (CD) learning to deteriorate. We adopt the idea behind Population Monte Carlo (PMC) methods to devise a new RBM training algorithm termed Population-Contrastive-Divergence (pop-CD). Compared to CD, it leads to a consistent estimate and may have a significantly lower bias. Its computational overhead is negligible compared to CD. However, the variance of the gradient estimate increases. We experimentally show that pop-CD can significantly outperform CD. In many cases, we observed a smaller bias and achieved…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.