A Dynamical Systems Perspective on the Analysis of Neural Networks
Dennis Chemnitz, Maximilian Engel, Christian Kuehn, Sara-Viola Kuntz

TL;DR
This paper applies dynamical systems theory to analyze neural networks, covering information propagation, training dynamics, stability, and mean-field limits, offering new insights into neural network behavior and training phenomena.
Contribution
It introduces a dynamical systems framework for neural network analysis, including universal embedding, stability of gradient descent, and mean-field limits, advancing theoretical understanding.
Findings
Universal embedding property for neural ODEs
Stability analysis of gradient descent and overparameterized networks
Mean-field limits for heterogeneous neural networks
Abstract
In this chapter, we utilize dynamical systems to analyze several aspects of machine learning algorithms. As an expository contribution we demonstrate how to re-formulate a wide variety of challenges from deep neural networks, (stochastic) gradient descent, and related topics into dynamical statements. We also tackle three concrete challenges. First, we consider the process of information propagation through a neural network, i.e., we study the input-output map for different architectures. We explain the universal embedding property for augmented neural ODEs representing arbitrary functions of given regularity, the classification of multilayer perceptrons and neural ODEs in terms of suitable function classes, and the memory-dependence in neural delay equations. Second, we consider the training aspect of neural networks dynamically. We describe a dynamical systems perspective on gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
