Cascade of phase transitions in the training of Energy-based models
Dimitrios Bachtis, Giulio Biroli, Aur\'elien Decelle, Beatriz Seoane

TL;DR
This paper explores the training dynamics of energy-based models, specifically RBMs, revealing a cascade of phase transitions during learning that correspond to the progressive encoding of data modes, validated through analytical and empirical methods.
Contribution
The study uncovers a series of phase transitions in RBM training, linking them to the learning of principal data modes, and validates the findings with real dataset experiments and a mean-field scaling hypothesis.
Findings
Identifies a cascade of phase transitions during RBM training.
Shows phase transitions correspond to learning principal data modes.
Validates analytical results with experiments on real datasets.
Abstract
In this paper, we investigate the feature encoding process in a prototypical energy-based generative model, the Restricted Boltzmann Machine (RBM). We start with an analytical investigation using simplified architectures and data structures, and end with numerical analysis of real trainings on real datasets. Our study tracks the evolution of the model's weight matrix through its singular value decomposition, revealing a series of phase transitions associated to a progressive learning of the principal modes of the empirical probability distribution. The model first learns the center of mass of the modes and then progressively resolve all modes through a cascade of phase transitions. We first describe this process analytically in a controlled setup that allows us to study analytically the training dynamics. We then validate our theoretical results by training the Bernoulli-Bernoulli RBM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Lattice Boltzmann Simulation Studies
MethodsRestricted Boltzmann Machine
