Jamming in multilayer supervised learning models

Silvio Franz; Sungmin Hwang; Pierfrancesco Urbani

arXiv:1809.09945·cond-mat.dis-nn·October 23, 2019

Jamming in multilayer supervised learning models

Silvio Franz, Sungmin Hwang, Pierfrancesco Urbani

PDF

TL;DR

This paper explores the universality of jamming transitions in multilayer neural networks, revealing conditions under which they exhibit sphere-like universality classes and proposing a dimensional reduction mechanism for finite-dimensional cases.

Contribution

It introduces a mean-field framework for multilayer neural networks in jamming, identifying a dimensional reduction that links their behavior to infinite-dimensional spheres.

Findings

01

Jamming in multilayer networks can recover sphere universality when isostatic.

02

Exact mean-field equations reveal a dimensional reduction mechanism.

03

The mechanism may explain universality in finite-dimensional systems.

Abstract

Critical jamming transitions are characterized by an astonishing degree of universality. Analytic and numerical evidence points to the existence of a large universality class that encompasses finite and infinite dimensional spheres and continuous constraint satisfaction problems (CCSP) such as the non-convex perceptron and related models. In this paper we investigate multilayer neural networks (MLNN) learning random associations as models for CCSP which could potentially define different jamming universality classes. As opposed to simple perceptrons and infinite dimensional spheres, which are described by a single effective field in terms of which the constraints appear to be one-dimensional, the description of MLNN, involves multiple fields, and the constraints acquire a multidimensional character. We first study the models numerically and show that similarly to the perceptron,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.