Any-Time Regret-Guaranteed Algorithm for Control of Linear Quadratic Systems
Jafar Abbaszadeh Chekan, Cedric Langbort

TL;DR
This paper introduces a computationally efficient, anytime regret-guaranteed control algorithm for linear quadratic systems that adapts to system stability and removes the need for prior bounds on the Riccati solution.
Contribution
It presents two variants of an algorithm with explicit regret bounds, removing the need for prior Riccati bounds and improving stability and exploration-exploitation balance.
Findings
Achieves ((rac{1}{2}) t) regret with explicit system dependence.
Removes the requirement of a priori bounds on the Riccati solution.
Addresses stability issues in existing optimistic algorithms.
Abstract
We propose a computationally efficient algorithm that achieves anytime regret of order , with explicit dependence on the system dimensions and on the solution of the Discrete Algebraic Riccati Equation (DARE). Our approach builds on the SDP-based framework of \cite{cohen2019learning}, using an appropriately tuned regularization and a sufficiently accurate initial estimate to construct confidence ellipsoids for control design. A carefully designed input-perturbation mechanism is incorporated to ensure anytime performance. We develop two variants of the algorithm. The first enforces a notion of strong sequential stability, requiring each policy to be stabilizing and successive policies to remain close. However, enforcing this notion results in a suboptimal regret scaling. The second removes the sequential-stability requirement and instead requires only that each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAerospace Engineering and Control Systems · Advanced Algorithms and Applications · Advanced Research in Science and Engineering
