Learn and Control while Switching: with Guaranteed Stability and   Sublinear Regret

Jafar Abbaszadeh Chekan; C\'edric Langbort

arXiv:2207.10827·eess.SY·October 4, 2023

Learn and Control while Switching: with Guaranteed Stability and Sublinear Regret

Jafar Abbaszadeh Chekan, C\'edric Langbort

PDF

Open Access

TL;DR

This paper introduces a novel learning and control algorithm for over-actuated systems that switch between actuators, ensuring stability and achieving sublinear regret in unknown system settings.

Contribution

It proposes an OFU-based algorithm with stability guarantees and an optimal warm-up duration for switching actuators in linear quadratic systems.

Findings

01

Guarantees system stability during switching.

02

Achieves sublinear regret of order √T.

03

Outperforms naive OFU approaches.

Abstract

Over-actuated systems often make it possible to achieve specific performances by switching between different subsets of actuators. However, when the system parameters are unknown, transferring authority to different subsets of actuators is challenging due to stability and performance efficiency concerns. This paper presents an efficient algorithm to tackle the so-called "learn and control while switching between different actuating modes" problem in the Linear Quadratic (LQ) setting. Our proposed strategy is constructed upon Optimism in the Face of Uncertainty (OFU) based algorithm equipped with a projection toolbox to keep the algorithm efficient, regret-wise. Along the way, we derive an optimum duration for the warm-up phase, thanks to the existence of a stabilizing neighborhood. The stability of the switched system is also guaranteed by designing a minimum average dwell time. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Control Systems and Identification · Reservoir Engineering and Simulation Methods