Regret-Guaranteed Safe Switching with Minimum Cost: LQR Setting with   Unknown Dynamics

Jafar Abbaszadeh Chekan; Cedric Langbort

arXiv:2302.02270·eess.SY·December 21, 2023

Regret-Guaranteed Safe Switching with Minimum Cost: LQR Setting with Unknown Dynamics

Jafar Abbaszadeh Chekan, Cedric Langbort

PDF

Open Access

TL;DR

This paper introduces a regret-minimizing control strategy for externally forced switched systems with unknown dynamics, ensuring safety through online learning of dwell times and achieving sublinear regret.

Contribution

It develops an online algorithm based on optimism in the face of uncertainty to learn dwell times and control policies for safe, cost-effective switching with unknown system parameters.

Findings

01

Expected regret of (|M| ns)

02

Effective online estimation of dwell times

03

Benchmarking shows improved performance over known-parameter scenarios

Abstract

Externally Forced Switched (EFS) systems represent a subset of switched systems where switches occur deliberately to meet an external requirement. However, fast switching can lead to instability, even when all closed-loop modes are stable. In this study, our focus is on an EFS scenario with \textit{unknown system dynamics}, where the next mode to switch to is revealed by an external entity in real-time as the switch occurs. The challenge is to track the revealed sequence while (1) minimizing accumulated cost in a regretful sense and (2) ensuring that the norm of the system's state does not grow excessively-a property we refer to as 'the safety of switching.' Achieving the latter involves requiring the closed-loop system to remain in each revealed mode for some minimum dwell time, which must be learned online. We propose an algorithm based on the principles of Optimism in the Face of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Fault Detection and Control Systems