Accelerated Optimization Landscape of Linear-Quadratic Regulator
Lechen Feng, Yuan-Hua Ni

TL;DR
This paper introduces accelerated optimization methods for solving the Linear-Quadratic Regulator (LQR) problem, providing convergence guarantees and improved complexity bounds for both state-feedback and output-feedback cases.
Contribution
It develops a first-order accelerated framework for LQR, including a hybrid dynamic system approach for SLQR and a Hessian-free method for OLQR, with theoretical convergence analysis.
Findings
Exponential convergence to optimal feedback gain in SLQR
Discretized Nesterov-type method preserves continuous-time convergence rate
OLQR method achieves $ ilde{O}( ext{poly}(1/ ext{epsilon}))$ complexity, better than vanilla gradient descent
Abstract
Linear-quadratic regulator (LQR) is a landmark problem in the field of optimal control, which is the concern of this paper. Generally, LQR is classified into state-feedback LQR (SLQR) and output-feedback LQR (OLQR) based on whether the full state is obtained. It has been suggested in existing literature that both SLQR and OLQR could be viewed as \textit{constrained nonconvex matrix optimization} problems in which the only variable to be optimized is the feedback gain matrix. In this paper, we introduce a first-order accelerated optimization framework of handling the LQR problem, and give its convergence analysis for the cases of SLQR and OLQR, respectively. Specifically, a Lipschiz Hessian property of LQR performance criterion is presented, which turns out to be a crucial property for the application of modern optimization techniques. For the SLQR problem, a continuous-time hybrid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Algorithms and Applications · Advanced Control Systems Optimization
