Glassy dynamics in deep neural networks: A structural comparison
Max Kerr Winter, Liesbeth M. C. Janssen

TL;DR
This paper explores the glass-like dynamics of deep neural networks, revealing both similarities and differences with structural glasses through quantitative analysis of trained models and their phase transitions.
Contribution
It provides the first detailed comparison of neural network dynamics with glass phenomenology, identifying phase transitions and analyzing their dynamic behavior.
Findings
Existence of a Topology Trivialisation Transition.
DNNs do not show diverging relaxation times at non-zero temperature.
Weight overlap follows a power law consistent with Mode-Coupling Theory.
Abstract
Deep Neural Networks (DNNs) share important similarities with structural glasses. Both have many degrees of freedom, and their dynamics are governed by a high-dimensional, non-convex landscape representing either the loss or energy, respectively. Furthermore, both experience gradient descent dynamics subject to noise. In this work we investigate, by performing quantitative measurements on realistic networks trained on the MNIST and CIFAR-10 datasets, the extent to which this qualitative similarity gives rise to glass-like dynamics in neural networks. We demonstrate the existence of a Topology Trivialisation Transition as well as the previously studied under-to-overparameterised transition analogous to jamming. By training DNNs with overdamped Langevin dynamics in the resulting disordered phases, we do not observe diverging relaxation times at non-zero temperature, nor do we observe any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCultural Heritage Materials Analysis
