A System Level Approach to Regret Optimal Control
Alexandre Didier, Jerome Sieber, Melanie N. Zeilinger

TL;DR
This paper introduces a system-level, optimisation-based method for designing controllers that minimize dynamic regret in linear systems, providing a new approach that handles structured problems and guarantees constraints.
Contribution
It presents a novel semi-definite programming framework for dynamic regret optimal control using system level parametrisation, including structured problems and disturbance bounds.
Findings
Dynamic regret bounds can be improved with pointwise ellipsoidal disturbance bounds.
The optimal dynamic regret differs by at most 2/π from the computed bound.
The framework guarantees state and input constraint satisfaction.
Abstract
We present an optimisation-based method for synthesising a dynamic regret optimal controller for linear systems with potentially adversarial disturbances and known or adversarial initial conditions. The dynamic regret is defined as the difference between the true incurred cost of the system and the cost which could have optimally been achieved under any input sequence having full knowledge of all future disturbances for a given disturbance energy. This problem formulation can be seen as an alternative to classical - or -control. The proposed controller synthesis is based on the system level parametrisation, which allows reformulating the dynamic regret problem as a semi-definite problem. This yields a new framework that allows to consider structured dynamic regret problems, which have not yet been considered in the literature. For known pointwise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research
