Stopping Criteria in Contrastive Divergence: Alternatives to the   Reconstruction Error

David Buchaca; Enrique Romero; Ferran Mazzanti; Jordi Delgado

arXiv:1312.6062·cs.LG·April 10, 2014·ICLR·5 cites

Stopping Criteria in Contrastive Divergence: Alternatives to the Reconstruction Error

David Buchaca, Enrique Romero, Ferran Mazzanti, Jordi Delgado

PDF

Open Access

TL;DR

This paper explores alternative stopping criteria to the reconstruction error for training Restricted Boltzmann Machines with Contrastive Divergence, aiming to better detect improvements in the model's log-likelihood during learning.

Contribution

It introduces and evaluates simple alternative metrics to the reconstruction error for more effective stopping decisions in RBM training.

Findings

01

Alternative criteria can better indicate log-likelihood improvements

02

Reconstruction error may not reliably signal training progress

03

Proposed methods improve training efficiency

Abstract

Restricted Boltzmann Machines (RBMs) are general unsupervised learning devices to ascertain generative models of data distributions. RBMs are often trained using the Contrastive Divergence learning algorithm (CD), an approximation to the gradient of the data log-likelihood. A simple reconstruction error is often used to decide whether the approximation provided by the CD algorithm is good enough, though several authors (Schulz et al., 2010; Fischer & Igel, 2010) have raised doubts concerning the feasibility of this procedure. However, not many alternatives to the reconstruction error have been used in the literature. In this manuscript we investigate simple alternatives to the reconstruction error in order to detect as soon as possible the decrease in the log-likelihood during learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Lattice Boltzmann Simulation Studies