Change Point Detection Approach for Online Control of Unknown Time   Varying Dynamical Systems

Deepan Muthirayan; Ruijie Du; Yanning Shen; and Pramod P. Khargonekar

arXiv:2210.11684·eess.SY·March 28, 2023

Change Point Detection Approach for Online Control of Unknown Time Varying Dynamical Systems

Deepan Muthirayan, Ruijie Du, Yanning Shen, and Pramod P. Khargonekar

PDF

Open Access

TL;DR

This paper introduces a change point detection method for online control of unknown, time-varying dynamical systems, achieving sub-linear regret and outperforming existing approaches in adaptive control scenarios.

Contribution

The paper presents the first regret guarantee for unknown time-varying systems using change point detection, applicable to general classes of systems with sub-linear changes.

Findings

01

Achieves a regret of $ ext{Gamma}_T^{1/5} T^{4/5}$ for systems with $ ext{Gamma}_T$ changes.

02

Demonstrates superior performance over restart and standard online learning methods.

03

Provides the first regret bounds based on the number of system changes.

Abstract

We propose a novel change point detection approach for online learning control with full information feedback (state, disturbance, and cost feedback) for unknown time-varying dynamical systems. We show that our algorithm can achieve a sub-linear regret with respect to the class of Disturbance Action Control (DAC) policies, which are a widely studied class of policies for online control of dynamical systems, for any sub-linear number of changes and very general class of systems: (i) matched disturbance system with general convex cost functions, (ii) general system with linear cost functions. Specifically, a (dynamic) regret of $Γ_{T}^{1/5} T^{4/5}$ can be achieved for these class of systems, where $Γ_{T}$ is the number of changes of the underlying system and $T$ is the duration of the control episode. That is, the change point detection approach achieves a sub-linear regret for any…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReceptor Mechanisms and Signaling · Advanced Control Systems Optimization · Advanced Bandit Algorithms Research