SAD Neural Networks: Divergent Gradient Flows and Asymptotic Optimality via o-minimal Structures

Julian Kranz; Davide Gallon; Steffen Dereich; Arnulf Jentzen

arXiv:2505.09572·cs.LG·January 13, 2026

SAD Neural Networks: Divergent Gradient Flows and Asymptotic Optimality via o-minimal Structures

Julian Kranz, Davide Gallon, Steffen Dereich, Arnulf Jentzen

PDF

Open Access 1 Repo

TL;DR

This paper analyzes the behavior of gradient flows in neural networks with common activation functions, showing conditions for convergence or divergence, and establishing asymptotic optimality using o-minimal structures, supported by numerical experiments.

Contribution

It introduces a rigorous geometric framework using o-minimal structures to analyze gradient flow dynamics and proves divergence or convergence properties in neural network training.

Findings

01

Gradient flows either converge to critical points or diverge to infinity.

02

A threshold exists below which the loss converges to the optimal value.

03

For large architectures and data, the optimal loss is asymptotically zero.

Abstract

We study gradient flows for loss landscapes of fully connected feedforward neural networks with commonly used continuously differentiable activation functions such as the logistic, hyperbolic tangent, softplus or GELU function. We prove that the gradient flow either converges to a critical point or diverges to infinity while the loss converges to an asymptotic critical value. Moreover, we prove the existence of a threshold $ε > 0$ such that the loss value of any gradient flow initialized at most $ε$ above the optimal level converges to it. For polynomial target functions and sufficiently big architecture and data set, we prove that the optimal loss value is zero and can only be realized asymptotically. From this setting, we deduce our main result that any gradient flow with sufficiently good initialization diverges to infinity. Our proof heavily relies on the geometry…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

deeplearningmethods/sad
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · (TravEL!!Guide)How Do I File a Claim with Expedia?