Optimal Tracking Control for Unknown Linear Systems with Finite-Time Parameter Estimation
Shengbo Wang, Shiping Wen, Kaibo Shi, Song Zhu, Tingwen Huang

TL;DR
This paper develops a finite-time parameter estimation method for unknown linear systems and applies it to optimal tracking control, demonstrating effectiveness through simulations.
Contribution
It introduces an improved DREM-based parameter estimation method combined with gradient descent for optimal tracking in unknown linear systems.
Findings
Finite-time parameter estimation achieved for unknown system matrices.
Effective tracking control demonstrated through simulations.
Method ensures existence of optimal solutions for heterogeneous systems.
Abstract
The optimal control input for linear systems can be solved from algebraic Riccati equation (ARE), from which it remains questionable to get the form of the exact solution. In engineering, the acceptable numerical solutions of ARE can be found by iteration or optimization. Recently, the gradient descent based numerical solutions has been proven effective to approximate the optimal ones. This paper introduces this method to tracking problem for heterogeneous linear systems. Differently, the parameters in the dynamics of the linear systems are all assumed to be unknown, which is intractable since the gradient as well as the allowable initialization needs the prior knowledge of system dynamics. To solve this problem, the method named dynamic regressor extension and mix (DREM) is improved to estimate the parameter matrices in finite time. Besides, a discounted factor is introduced to ensure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Adaptive Control of Nonlinear Systems · Stability and Control of Uncertain Systems
