Sub-linear Regret in Adaptive Model Predictive Control

Damianos Tranos; Alexandre Proutiere

arXiv:2310.04842·eess.SY·October 10, 2023

Sub-linear Regret in Adaptive Model Predictive Control

Damianos Tranos, Alexandre Proutiere

PDF

Open Access

TL;DR

This paper introduces STT-MPC, an adaptive control algorithm for uncertain linear systems that achieves sub-linear regret growth, ensuring constraint satisfaction and stability while learning system dynamics online.

Contribution

The paper presents the first analysis of regret bounds for adaptive MPC with polytopic tubes, demonstrating sub-linear regret growth of O(T^{1/2 + ε}) in this setting.

Findings

01

Expected regret of STT-MPC is bounded by O(T^{1/2 + ε})

02

STT-MPC guarantees constraint satisfaction and stability

03

Numerical example illustrates effective learning and control

Abstract

We consider the problem of adaptive Model Predictive Control (MPC) for uncertain linear-systems with additive disturbances and with state and input constraints. We present STT-MPC (Self-Tuning Tube-based Model Predictive Control), an online algorithm that combines the certainty-equivalence principle and polytopic tubes. Specifically, at any given step, STT-MPC infers the system dynamics using the Least Squares Estimator (LSE), and applies a controller obtained by solving an MPC problem using these estimates. The use of polytopic tubes is so that, despite the uncertainties, state and input constraints are satisfied, and recursive-feasibility and asymptotic stability hold. In this work, we analyze the regret of the algorithm, when compared to an oracle algorithm initially aware of the system dynamics. We establish that the expected regret of STT-MPC does not exceed $O(T^{1/2 +…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Eicosanoids and Hypertension Pharmacology · Stability and Control of Uncertain Systems

MethodsAttentive Walk-Aggregating Graph Neural Network · Exponential Decay