High-dimensional learning of narrow neural networks

Hugo Cui

arXiv:2409.13904·stat.ML·January 16, 2025

High-dimensional learning of narrow neural networks

Hugo Cui

PDF

Open Access

TL;DR

This paper reviews recent advances in the theoretical understanding of high-dimensional neural network learning using statistical physics, introducing a unified model framework and detailed analysis techniques.

Contribution

It presents a comprehensive review of statistical physics methods applied to neural networks, introducing the sequence multi-index model as a unifying framework.

Findings

01

Unified analysis of various neural network models

02

Application of replica method and message-passing algorithms

03

Insights into high-dimensional learning efficiency

Abstract

Recent years have been marked with the fast-pace diversification and increasing ubiquity of machine learning applications. Yet, a firm theoretical understanding of the surprising efficiency of neural networks to learn from high-dimensional data still proves largely elusive. In this endeavour, analyses inspired by statistical physics have proven instrumental, enabling the tight asymptotic characterization of the learning of neural networks in high dimensions, for a broad class of solvable models. This manuscript reviews the tools and ideas underlying recent progress in this line of work. We introduce a generic model -- the sequence multi-index model -- which encompasses numerous previously studied models as special instances. This unified framework covers a broad class of machine learning architectures with a finite number of hidden units, including multi-layer perceptrons, autoencoders,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsSoftmax · Attention Is All You Need