Learning a Restricted Boltzmann Machine using biased Monte Carlo sampling
Nicolas B\'ereux, Aur\'elien Decelle, Cyril Furtlehner, Beatriz Seoane

TL;DR
This paper introduces a biased Monte Carlo sampling method, Tethered Monte Carlo, to accelerate sampling and training of Restricted Boltzmann Machines, especially on low-dimensional clustered datasets, improving efficiency and model quality.
Contribution
The paper demonstrates that Tethered Monte Carlo can significantly speed up RBM training and sampling by overcoming phase coexistence issues, a novel application in this context.
Findings
TMC accelerates sampling in RBMs with clustered datasets.
TMC improves the computation of log-likelihood gradients during training.
TMC enables recovery of the RBM free-energy profile.
Abstract
Restricted Boltzmann Machines are simple and powerful generative models that can encode any complex dataset. Despite all their advantages, in practice the trainings are often unstable and it is difficult to assess their quality because the dynamics are affected by extremely slow time dependencies. This situation becomes critical when dealing with low-dimensional clustered datasets, where the time required to sample ergodically the trained models becomes computationally prohibitive. In this work, we show that this divergence of Monte Carlo mixing times is related to a phenomenon of phase coexistence, similar to that which occurs in physics near a first-order phase transition. We show that sampling the equilibrium distribution using the Markov chain Monte Carlo method can be dramatically accelerated when using biased sampling techniques, in particular the Tethered Monte Carlo (TMC)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Lattice Boltzmann Simulation Studies · Domain Adaptation and Few-Shot Learning
