On the role of non-linear latent features in bipartite generative neural networks

Tony Bonnaire; Giovanni Catania; Aur\'elien Decelle; Beatriz Seoane

arXiv:2506.10552·cond-mat.dis-nn·December 3, 2025

On the role of non-linear latent features in bipartite generative neural networks

Tony Bonnaire, Giovanni Catania, Aur\'elien Decelle, Beatriz Seoane

PDF

TL;DR

This paper analyzes how the choice of hidden unit priors and architectural modifications in bipartite energy-based neural networks, specifically RBMs, affect their phase diagram and memory retrieval capabilities, revealing ways to improve their associative memory performance.

Contribution

It provides a theoretical analysis linking hidden unit priors to the thermodynamic properties of RBMs and proposes modifications to enhance their memory retrieval abilities.

Findings

01

Binary RBMs have limited critical capacity.

02

Introducing biases and richer priors improves retrieval performance.

03

Theoretical results are supported by Monte Carlo simulations.

Abstract

We investigate the phase diagram and memory retrieval capabilities of bipartite energy-based neural networks, namely Restricted Boltzmann Machines (RBMs), as a function of the prior distribution imposed on their hidden units - including binary, multi-state, and ReLU-like activations. Drawing connections to the Hopfield model and employing analytical tools from statistical physics of disordered systems, we explore how the architectural choices and activation functions shape the thermodynamic properties of these models. Our analysis reveals that standard RBMs with binary hidden nodes and extensive connectivity suffer from reduced critical capacity, limiting their effectiveness as associative memories. To address this, we examine several modifications, such as introducing local biases and adopting richer hidden unit priors. These adjustments restore ordered retrieval phases and markedly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.