Pattern reconstruction with restricted Boltzmann machines

Giuseppe Genovese

arXiv:2205.07087·math.PR·October 31, 2023

Pattern reconstruction with restricted Boltzmann machines

Giuseppe Genovese

PDF

Open Access

TL;DR

This paper analyzes how the tail behavior of the hidden layer prior in restricted Boltzmann machines influences their ability to reconstruct patterns, revealing that tail properties critically determine retrieval performance.

Contribution

It provides a theoretical characterization of the energy landscape and shows how the tail behavior of the hidden prior affects pattern reconstruction capabilities.

Findings

01

Super-Gaussian tails lead to logarithmic loss in retrieval

02

Sub-Gaussian tails significantly hinder pattern retrieval

03

Gaussian tails' retrieval depends on the number of hidden units

Abstract

Restricted Boltzmann machines are energy models made of a visible and a hidden layer. We identify an effective energy function describing the zero-temperature landscape on the visible units and depending only on the tail behaviour of the hidden layer prior distribution. Studying the location of the local minima of such an energy function, we show that the ability of a restricted Boltzmann machine to reconstruct a random pattern depends indeed only on the tail of the hidden prior distribution. We find that hidden priors with strictly super-Gaussian tails give only a logarithmic loss in pattern retrieval, while an efficient retrieval is much harder with hidden units with strictly sub-Gaussian tails; if the hidden prior has Gaussian tails, the retrieval capability is determined by the number of hidden units (as in the Hopfield model).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Neural Networks and Applications · Theoretical and Computational Physics

MethodsRestricted Boltzmann Machine