The effect of priors on Learning with Restricted Boltzmann Machines

Gianluca Manzan; Daniele Tantari

arXiv:2412.02623·cond-mat.dis-nn·February 2, 2026·2 cites

The effect of priors on Learning with Restricted Boltzmann Machines

Gianluca Manzan, Daniele Tantari

PDF

Open Access

TL;DR

This paper investigates how different priors in Restricted Boltzmann Machines affect learning efficiency and generalization, revealing a critical dataset size and how priors influence training and signal retrieval.

Contribution

It introduces a parametric class of priors interpolating between Gaussian and binary variables, analyzing their impact on RBM learning in teacher-student setups.

Findings

01

Existence of a critical dataset size for learning

02

Prudent prior choices expand the signal retrieval region

03

Critical size depends on teacher properties, not student priors

Abstract

Restricted Boltzmann Machines (RBMs) are generative models designed to learn from data with a rich underlying structure. In this work, we explore a teacher-student setting where a student RBM learns from examples generated by a teacher RBM, with a focus on the effect of the unit priors on learning efficiency. We consider a parametric class of priors that interpolate between continuous (Gaussian) and binary variables. This approach models various possible choices of visible units, hidden units, and weights for both the teacher and student RBMs. By analyzing the phase diagram of the posterior distribution in both the Bayes optimal and mismatched regimes, we demonstrate the existence of a triple point that defines the critical dataset size necessary for learning through generalization. The critical size is strongly influenced by the properties of the teacher, and thus the data, but is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Stochastic Gradient Optimization Techniques · Machine Learning and ELM

MethodsFocus