Exact full-RSB SAT/UNSAT transition in infinitely wide two-layer neural networks
Brandon L. Annesi, Enrico M. Malatesta, Francesco Zamponi

TL;DR
This paper provides an exact analysis of the SAT/UNSAT transition in infinitely wide two-layer neural networks, revealing the limitations of gradient descent and AMP algorithms in reaching maximal capacity due to complex state overlaps.
Contribution
It introduces a full-RSB analysis for the capacity transition in continuous neural network models and uncovers the impact of overlap gaps on algorithmic performance.
Findings
Exact SAT/UNSAT transition computed using full-RSB ansatz
Overlap gap in the negative perceptron model affects algorithm convergence
Gradient Descent cannot reach maximal capacity, independent of overlap gaps
Abstract
We analyze the problem of storing random pattern-label associations using two classes of continuous non-convex weights models, namely the perceptron with negative margin and an infinite-width two-layer neural network with non-overlapping receptive fields and generic activation function. Using a full-RSB ansatz we compute the exact value of the SAT/UNSAT transition. Furthermore, in the case of the negative perceptron we show that the overlap distribution of typical states displays an overlap gap (a disconnected support) in certain regions of the phase diagram defined by the value of the margin and the density of patterns to be stored. This implies that some recent theorems that ensure convergence of Approximate Message Passing (AMP) based algorithms to capacity are not applicable. Finally, we show that Gradient Descent is not able to reach the maximal capacity, irrespectively of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Reservoir Engineering and Simulation Methods · Neural Networks and Applications
