Change Point Detection Approach for Online Control of Unknown Time Varying Dynamical Systems
Deepan Muthirayan, Ruijie Du, Yanning Shen, and Pramod P. Khargonekar

TL;DR
This paper introduces a change point detection method for online control of unknown, time-varying dynamical systems, achieving sub-linear regret and outperforming existing approaches in adaptive control scenarios.
Contribution
The paper presents the first regret guarantee for unknown time-varying systems using change point detection, applicable to general classes of systems with sub-linear changes.
Findings
Achieves a regret of $ ext{Gamma}_T^{1/5} T^{4/5}$ for systems with $ ext{Gamma}_T$ changes.
Demonstrates superior performance over restart and standard online learning methods.
Provides the first regret bounds based on the number of system changes.
Abstract
We propose a novel change point detection approach for online learning control with full information feedback (state, disturbance, and cost feedback) for unknown time-varying dynamical systems. We show that our algorithm can achieve a sub-linear regret with respect to the class of Disturbance Action Control (DAC) policies, which are a widely studied class of policies for online control of dynamical systems, for any sub-linear number of changes and very general class of systems: (i) matched disturbance system with general convex cost functions, (ii) general system with linear cost functions. Specifically, a (dynamic) regret of can be achieved for these class of systems, where is the number of changes of the underlying system and is the duration of the control episode. That is, the change point detection approach achieves a sub-linear regret for any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReceptor Mechanisms and Signaling · Advanced Control Systems Optimization · Advanced Bandit Algorithms Research
