Regret-Guaranteed Safe Switching with Minimum Cost: LQR Setting with Unknown Dynamics
Jafar Abbaszadeh Chekan, Cedric Langbort

TL;DR
This paper introduces a regret-minimizing control strategy for externally forced switched systems with unknown dynamics, ensuring safety through online learning of dwell times and achieving sublinear regret.
Contribution
It develops an online algorithm based on optimism in the face of uncertainty to learn dwell times and control policies for safe, cost-effective switching with unknown system parameters.
Findings
Expected regret of (|M| ns)
Effective online estimation of dwell times
Benchmarking shows improved performance over known-parameter scenarios
Abstract
Externally Forced Switched (EFS) systems represent a subset of switched systems where switches occur deliberately to meet an external requirement. However, fast switching can lead to instability, even when all closed-loop modes are stable. In this study, our focus is on an EFS scenario with \textit{unknown system dynamics}, where the next mode to switch to is revealed by an external entity in real-time as the switch occurs. The challenge is to track the revealed sequence while (1) minimizing accumulated cost in a regretful sense and (2) ensuring that the norm of the system's state does not grow excessively-a property we refer to as 'the safety of switching.' Achieving the latter involves requiring the closed-loop system to remain in each revealed mode for some minimum dwell time, which must be learned online. We propose an algorithm based on the principles of Optimism in the Face of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Fault Detection and Control Systems
