Convergence Analysis of Gradient Flow for Overparameterized LQR Formulations
Arthur Castello B. de Oliveira, Milad Siami, Eduardo D. Sontag

TL;DR
This paper provides a theoretical analysis of gradient flow convergence in overparameterized neural network-based LQR control, demonstrating conditions for optimality, stability, and accelerated convergence through both proofs and numerical experiments.
Contribution
It introduces a novel convergence analysis for overparameterized neural network controllers in LQR problems, including proofs of optimality and stability, and shows empirical acceleration in training.
Findings
Conservation law of the system's gradient structure.
Global convergence to optimal feedback for single hidden layer networks.
Empirical evidence of faster convergence with overparameterization.
Abstract
Motivated by the growing use of artificial intelligence (AI) tools in control design, this paper analyses the intersection between results from gradient methods for the model-free linear quadratic regulator (LQR), and linear feedforward neural networks (LFFNNs), More specifically, it looks into the case where one wants to find a LFFNN feedback that minimizes a LQR cost. This paper starts by analyzing the structure of the gradient expression for the parameters of each layer, which implies a key conservation law of the system. This conservation law is then leveraged to generalize existing results on boundedness and global convergence of solutions to critical points, and invariance of the set of stabilizing networks under the training dynamics. This is followed by an analysis of the case where the LFFNN has a single hidden layer, for which the paper proves that the training converges not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Control Systems and Identification · Matrix Theory and Algorithms
MethodsSparse Evolutionary Training
