Regret-Optimal LQR Control

Oron Sabag; Gautam Goel; Sahin Lale; Babak Hassibi

arXiv:2105.01244·math.OC·April 14, 2023

Regret-Optimal LQR Control

Oron Sabag, Gautam Goel, Sahin Lale, Babak Hassibi

PDF

Open Access

TL;DR

This paper introduces a regret-optimal controller for infinite-horizon LQR problems, minimizing worst-case regret against a clairvoyant controller with future disturbance knowledge, and provides explicit formulas for its design.

Contribution

It formulates a novel regret-based criterion for LQR control, deriving explicit solutions that interpolate between $H_2$ and $H_$ optimal controllers.

Findings

01

The regret-optimal controller is linear and explicitly constructed from standard LQR solutions.

02

Simulations show the controller balances $H_2$ and $H_$ costs effectively.

03

Explicit formulas for the optimal regret and controller are derived via a Nehari extension approach.

Abstract

We consider the infinite-horizon LQR control problem. Motivated by competitive analysis in online learning, as a criterion for controller design we introduce the dynamic regret, defined as the difference between the LQR cost of a causal controller (that has only access to past disturbances) and the LQR cost of the \emph{unique} clairvoyant one (that has also access to future disturbances) that is known to dominate all other controllers. The regret itself is a function of the disturbances, and we propose to find a causal controller that minimizes the worst-case regret over all bounded energy disturbances. The resulting controller has the interpretation of guaranteeing the smallest regret compared to the best non-causal controller that can see the future. We derive explicit formulas for the optimal regret and for the regret-optimal controller for the state-space setting. These explicit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management