On the relation between dynamic regret and closed-loop stability

Marko Nonhoff; Matthias A. M\"uller

arXiv:2209.05964·math.OC·June 16, 2023·Syst. Control. Lett.·1 cites

On the relation between dynamic regret and closed-loop stability

Marko Nonhoff, Matthias A. M\"uller

PDF

Open Access

TL;DR

This paper explores the connection between bounded dynamic regret and closed-loop stability, establishing conditions under which they imply each other in systems with unknown, time-varying costs.

Contribution

It provides the first formal analysis linking bounded dynamic regret with asymptotic stability in adaptive control systems with unknown costs.

Findings

01

Bounded dynamic regret implies asymptotic stability for constant costs.

02

Necessary conditions for bounded regret in asymptotically stable systems.

03

Sufficient conditions for bounded regret under additional assumptions.

Abstract

In this work, we study the relations between bounded dynamic regret and the classical notion of asymptotic stability for the case of a priori unknown and time-varying cost functions. In particular, we show that bounded dynamic regret implies asymptotic stability of the optimal steady state for a constant cost function. For the case of an asymptotically stable closed loop, we first derive a necessary condition for achieving bounded dynamic regret. Then, given some additional assumptions on the system and the cost functions, we also provide a sufficient condition ensuring bounded dynamic regret. Our results are illustrated by examples.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research