Regret Analysis of Distributed Online Control for LTI Systems with   Adversarial Disturbances

Ting-Jui Chang; Shahin Shahrampour

arXiv:2310.03206·math.OC·October 6, 2023

Regret Analysis of Distributed Online Control for LTI Systems with Adversarial Disturbances

Ting-Jui Chang, Shahin Shahrampour

PDF

Open Access

TL;DR

This paper develops distributed online control algorithms for LTI systems under adversarial disturbances, providing regret bounds for both known and unknown dynamics scenarios, advancing the robustness and learning capabilities of multi-agent control systems.

Contribution

It introduces a fully distributed disturbance feedback controller with regret bounds for known dynamics and a novel explore-then-commit approach for unknown dynamics in adversarial settings.

Findings

01

Regret bound of $O(\sqrt{T}\log T)$ for known dynamics.

02

Regret bound of $O(T^{2/3} ext{poly}(\log T))$ for unknown dynamics.

03

Effective distributed control strategies under adversarial disturbances.

Abstract

This paper addresses the distributed online control problem over a network of linear time-invariant (LTI) systems (with possibly unknown dynamics) in the presence of adversarial perturbations. There exists a global network cost that is characterized by a time-varying convex function, which evolves in an adversarial manner and is sequentially and partially observed by local agents. The goal of each agent is to generate a control sequence that can compete with the best centralized control policy in hindsight, which has access to the global cost. This problem is formulated as a regret minimization. For the case of known dynamics, we propose a fully distributed disturbance feedback controller that guarantees a regret bound of $O (T lo g T)$ , where $T$ is the time horizon. For the unknown dynamics case, we design a distributed explore-then-commit approach, where in the exploration phase…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Advanced Control Systems Optimization · Adaptive Dynamic Programming Control