Learning a Restricted Boltzmann Machine using biased Monte Carlo   sampling

Nicolas B\'ereux; Aur\'elien Decelle; Cyril Furtlehner; Beatriz Seoane

arXiv:2206.01310·cs.LG·March 16, 2023·1 cites

Learning a Restricted Boltzmann Machine using biased Monte Carlo sampling

Nicolas B\'ereux, Aur\'elien Decelle, Cyril Furtlehner, Beatriz Seoane

PDF

Open Access 1 Repo

TL;DR

This paper introduces a biased Monte Carlo sampling method, Tethered Monte Carlo, to accelerate sampling and training of Restricted Boltzmann Machines, especially on low-dimensional clustered datasets, improving efficiency and model quality.

Contribution

The paper demonstrates that Tethered Monte Carlo can significantly speed up RBM training and sampling by overcoming phase coexistence issues, a novel application in this context.

Findings

01

TMC accelerates sampling in RBMs with clustered datasets.

02

TMC improves the computation of log-likelihood gradients during training.

03

TMC enables recovery of the RBM free-energy profile.

Abstract

Restricted Boltzmann Machines are simple and powerful generative models that can encode any complex dataset. Despite all their advantages, in practice the trainings are often unstable and it is difficult to assess their quality because the dynamics are affected by extremely slow time dependencies. This situation becomes critical when dealing with low-dimensional clustered datasets, where the time required to sample ergodically the trained models becomes computationally prohibitive. In this work, we show that this divergence of Monte Carlo mixing times is related to a phenomenon of phase coexistence, similar to that which occurs in physics near a first-order phase transition. We show that sampling the equilibrium distribution using the Markov chain Monte Carlo method can be dramatically accelerated when using biased sampling techniques, in particular the Tethered Monte Carlo (TMC)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nbereux/tmcrbm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Lattice Boltzmann Simulation Studies · Domain Adaptation and Few-Shot Learning