A System Level Approach to Regret Optimal Control

Alexandre Didier; Jerome Sieber; Melanie N. Zeilinger

arXiv:2202.13763·eess.SY·May 31, 2022

A System Level Approach to Regret Optimal Control

Alexandre Didier, Jerome Sieber, Melanie N. Zeilinger

PDF

Open Access

TL;DR

This paper introduces a system-level, optimisation-based method for designing controllers that minimize dynamic regret in linear systems, providing a new approach that handles structured problems and guarantees constraints.

Contribution

It presents a novel semi-definite programming framework for dynamic regret optimal control using system level parametrisation, including structured problems and disturbance bounds.

Findings

01

Dynamic regret bounds can be improved with pointwise ellipsoidal disturbance bounds.

02

The optimal dynamic regret differs by at most 2/π from the computed bound.

03

The framework guarantees state and input constraint satisfaction.

Abstract

We present an optimisation-based method for synthesising a dynamic regret optimal controller for linear systems with potentially adversarial disturbances and known or adversarial initial conditions. The dynamic regret is defined as the difference between the true incurred cost of the system and the cost which could have optimally been achieved under any input sequence having full knowledge of all future disturbances for a given disturbance energy. This problem formulation can be seen as an alternative to classical $H_{2}$ - or $H_{\infty}$ -control. The proposed controller synthesis is based on the system level parametrisation, which allows reformulating the dynamic regret problem as a semi-definite problem. This yields a new framework that allows to consider structured dynamic regret problems, which have not yet been considered in the literature. For known pointwise…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research