Modeling Structured Data Learning with Restricted Boltzmann Machines in the Teacher-Student Setting

Robin Th\'eriault; Francesco Tosello; Daniele Tantari

arXiv:2410.16150·cs.LG·May 21, 2025

Modeling Structured Data Learning with Restricted Boltzmann Machines in the Teacher-Student Setting

Robin Th\'eriault, Francesco Tosello, Daniele Tantari

PDF

Open Access

TL;DR

This paper investigates how Restricted Boltzmann Machines (RBMs) learn structured data in a teacher-student framework, revealing how data complexity, correlations, and regularization influence learning performance and generalization.

Contribution

It extends the teacher-student RBM framework to arbitrary hidden units, analyzes the impact of data structure and regularization, and links findings to the lottery ticket hypothesis.

Findings

01

Performance is independent of teacher patterns and student hidden units without correlations.

02

Critical data amount decreases with more teacher patterns and correlations.

03

Low inference temperature impedes learning teacher patterns.

Abstract

Restricted Boltzmann machines (RBM) are generative models capable to learn data with a rich underlying structure. We study the teacher-student setting where a student RBM learns structured data generated by a teacher RBM. The amount of structure in the data is controlled by adjusting the number of hidden units of the teacher and the correlations in the rows of the weights, a.k.a. patterns. In the absence of correlations, we validate the conjecture that the performance is independent of the number of teacher patters and hidden units of the student RBMs, and we argue that the teacher-student setting can be used as a toy model for studying the lottery ticket hypothesis. Beyond this regime, we find that the critical amount of data required to learn the teacher patterns decreases with both their number and correlations. In both regimes, we find that, even with a relatively large dataset, it…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Generative Adversarial Networks and Image Synthesis