Dense Hebbian neural networks: a replica symmetric picture of   unsupervised learning

Elena Agliari; Linda Albanese; Francesco Alemanno; Andrea; Alessandrelli; Adriano Barra; Fosca Giannotti; Daniele Lotito; Dino Pedreschi

arXiv:2211.14067·cond-mat.dis-nn·July 4, 2023

Dense Hebbian neural networks: a replica symmetric picture of unsupervised learning

Elena Agliari, Linda Albanese, Francesco Alemanno, Andrea, Alessandrelli, Adriano Barra, Fosca Giannotti, Daniele Lotito, Dino Pedreschi

PDF

Open Access

TL;DR

This paper analyzes dense associative neural networks trained without supervision using statistical mechanics and simulations, revealing their performance and linking physical observables with machine learning loss functions.

Contribution

It provides a comprehensive phase diagram for unsupervised dense neural networks and introduces a novel analytical and computational framework combining Guerra's interpolation and Plefka approximation.

Findings

01

Identified performance regimes depending on dataset quality and size.

02

Established a connection between statistical mechanics observables and machine learning loss functions.

03

Developed a new approach combining large deviations, Guerra's interpolation, and Plefka approximation.

Abstract

We consider dense, associative neural-networks trained with no supervision and we investigate their computational capabilities analytically, via a statistical-mechanics approach, and numerically, via Monte Carlo simulations. In particular, we obtain a phase diagram summarizing their performance as a function of the control parameters such as the quality and quantity of the training dataset and the network storage, valid in the limit of large network size and structureless datasets. Moreover, we establish a bridge between macroscopic observables standardly used in statistical mechanics and loss functions typically used in the machine learning. As technical remarks, from the analytic side, we implement large deviations and stability analysis within Guerra's interpolation to tackle the not-Gaussian distributions involved in the post-synaptic potentials while, from the computational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Stochastic Gradient Optimization Techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings