Identification and Adaptive Control of Markov Jump Systems: Sample Complexity and Regret Bounds
Yahya Sattar, Zhe Du, Davoud Ataee Tarzanagh, Laura Balzano, Necmiye Ozay, Samet Oymak

TL;DR
This paper develops a sample-efficient adaptive control method for unknown Markov jump systems, providing theoretical guarantees on identification accuracy and regret bounds, with practical implications for controlling systems with changing dynamics.
Contribution
It introduces a novel identification algorithm for Markov jump systems and combines it with adaptive control to achieve near-optimal regret bounds, advancing control of systems with stochastic mode switching.
Findings
Sample complexity of identification is O(1/√T).
Adaptive control achieves O(√T) regret.
Regret can be reduced to polylogarithmic with partial system knowledge.
Abstract
Learning how to effectively control unknown dynamical systems is crucial for intelligent autonomous systems. This task becomes a significant challenge when the underlying dynamics are changing with time. Motivated by this challenge, this paper considers the problem of controlling an unknown Markov jump linear system (MJS) to optimize a quadratic objective. By taking a model-based perspective, we consider identification-based adaptive control of MJSs. We first provide a system identification algorithm for MJS to learn the dynamics in each mode as well as the Markov transition matrix, underlying the evolution of the mode switches, from a single trajectory of the system states, inputs, and modes. Through martingale-based arguments, sample complexity of this algorithm is shown to be . We then propose an adaptive control scheme that performs system identification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsControl Systems and Identification · Fault Detection and Control Systems · Advanced Control Systems Optimization
