Optimization Landscape of Gradient Descent for Discrete-time Static   Output Feedback

Jingliang Duan; Jie Li; Shengbo Eben Li; Lin Zhao

arXiv:2109.13132·math.OC·October 31, 2023·1 cites

Optimization Landscape of Gradient Descent for Discrete-time Static Output Feedback

Jingliang Duan, Jie Li, Shengbo Eben Li, Lin Zhao

PDF

Open Access 2 Repos

TL;DR

This paper investigates the optimization landscape of gradient descent for static output feedback control in discrete-time systems, establishing convergence properties and local optimality conditions.

Contribution

It provides a detailed analysis of the cost function's properties and proves convergence and local optimality results for gradient descent in SOF control.

Findings

01

Gradient descent converges to stationary points at a dimension-free rate.

02

Under mild conditions, gradient descent converges linearly to a local minimum.

03

The results shed light on policy gradient methods in reinforcement learning.

Abstract

In this paper, we analyze the optimization landscape of gradient descent methods for static output feedback (SOF) control of discrete-time linear time-invariant systems with quadratic cost. The SOF setting can be quite common, for example, when there are unmodeled hidden states in the underlying process. We first establish several important properties of the SOF cost function, including coercivity, L-smoothness, and M-Lipschitz continuous Hessian. We then utilize these properties to show that the gradient descent is able to converge to a stationary point at a dimension-free rate. Furthermore, we prove that under some mild conditions, gradient descent converges linearly to a local minimum if the starting point is close to one. These results not only characterize the performance of gradient descent in optimizing the SOF problem, but also shed light on the efficiency of general policy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Advanced Control Systems Optimization · Reinforcement Learning in Robotics