A Dynamical Systems Perspective on the Analysis of Neural Networks

Dennis Chemnitz; Maximilian Engel; Christian Kuehn; Sara-Viola Kuntz

arXiv:2507.05164·math.DS·July 8, 2025

A Dynamical Systems Perspective on the Analysis of Neural Networks

Dennis Chemnitz, Maximilian Engel, Christian Kuehn, Sara-Viola Kuntz

PDF

TL;DR

This paper applies dynamical systems theory to analyze neural networks, covering information propagation, training dynamics, stability, and mean-field limits, offering new insights into neural network behavior and training phenomena.

Contribution

It introduces a dynamical systems framework for neural network analysis, including universal embedding, stability of gradient descent, and mean-field limits, advancing theoretical understanding.

Findings

01

Universal embedding property for neural ODEs

02

Stability analysis of gradient descent and overparameterized networks

03

Mean-field limits for heterogeneous neural networks

Abstract

In this chapter, we utilize dynamical systems to analyze several aspects of machine learning algorithms. As an expository contribution we demonstrate how to re-formulate a wide variety of challenges from deep neural networks, (stochastic) gradient descent, and related topics into dynamical statements. We also tackle three concrete challenges. First, we consider the process of information propagation through a neural network, i.e., we study the input-output map for different architectures. We explain the universal embedding property for augmented neural ODEs representing arbitrary functions of given regularity, the classification of multilayer perceptrons and neural ODEs in terms of suitable function classes, and the memory-dependence in neural delay equations. Second, we consider the training aspect of neural networks dynamically. We describe a dynamical systems perspective on gradient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.