A gradient flow on control space with rough initial condition
Paul Gassiat, Florin Suciu

TL;DR
This paper investigates a gradient flow approach in control space with rough initial conditions, showing convergence properties and implications for neural network training, especially when initializations are irregular or stochastic.
Contribution
It introduces a novel analysis of gradient flows with rough initial conditions using Lyons' rough path theory, demonstrating saddle point behavior and convergence in specific cases.
Findings
Saddle points are moved to infinity with rough initializations.
Gradient flow converges to solutions when initial conditions are Brownian paths.
The approach has implications for training deep neural networks with limited parameters.
Abstract
We consider the (sub-Riemannian type) control problem of finding a path going from an initial point to a target point , by only moving in certain admissible directions. We assume that the corresponding vector fields satisfy the bracket-generating (H\"ormander) condition, so that the classical Chow-Rashevskii theorem guarantees the existence of such a path. One natural way to try to solve this problem is via a gradient flow on control space. However, since the corresponding dynamics may have saddle points, any convergence result must rely on suitable (e.g. random) initialisation. We consider the case when this initialisation is irregular, which is conveniently formulated via Lyons' rough path theory. We show that one advantage of this initialisation is that the saddle points are moved to infinity, while minima remain at a finite distance from the starting point. In the step…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
