Sub-linear Regret in Adaptive Model Predictive Control
Damianos Tranos, Alexandre Proutiere

TL;DR
This paper introduces STT-MPC, an adaptive control algorithm for uncertain linear systems that achieves sub-linear regret growth, ensuring constraint satisfaction and stability while learning system dynamics online.
Contribution
The paper presents the first analysis of regret bounds for adaptive MPC with polytopic tubes, demonstrating sub-linear regret growth of O(T^{1/2 + ε}) in this setting.
Findings
Expected regret of STT-MPC is bounded by O(T^{1/2 + ε})
STT-MPC guarantees constraint satisfaction and stability
Numerical example illustrates effective learning and control
Abstract
We consider the problem of adaptive Model Predictive Control (MPC) for uncertain linear-systems with additive disturbances and with state and input constraints. We present STT-MPC (Self-Tuning Tube-based Model Predictive Control), an online algorithm that combines the certainty-equivalence principle and polytopic tubes. Specifically, at any given step, STT-MPC infers the system dynamics using the Least Squares Estimator (LSE), and applies a controller obtained by solving an MPC problem using these estimates. The use of polytopic tubes is so that, despite the uncertainties, state and input constraints are satisfied, and recursive-feasibility and asymptotic stability hold. In this work, we analyze the regret of the algorithm, when compared to an oracle algorithm initially aware of the system dynamics. We establish that the expected regret of STT-MPC does not exceed $O(T^{1/2 +…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Eicosanoids and Hypertension Pharmacology · Stability and Control of Uncertain Systems
MethodsAttentive Walk-Aggregating Graph Neural Network · Exponential Decay
