Simultaneous System Identification and Model Predictive Control with No Dynamic Regret
Hongyu Zhou, Vasileios Tzoumas

TL;DR
This paper introduces an algorithm for simultaneous system identification and model predictive control of nonlinear systems, with guarantees of near-optimality and sublinear dynamic regret, applicable to systems with unknown, adaptive disturbances.
Contribution
The paper presents a self-supervised algorithm that combines system identification with model predictive control, providing finite-time guarantees and handling unknown, adaptive disturbances modeled in reproducing kernel Hilbert spaces.
Findings
Algorithm achieves sublinear dynamic regret.
Validated on hardware and simulations with unmodeled disturbances.
Successfully controls systems with unknown, adaptive dynamics.
Abstract
We provide an algorithm for the simultaneous system identification and model predictive control of nonlinear systems. The algorithm has finite-time near-optimality guarantees and asymptotically converges to the optimal (non-causal) controller. Particularly, the algorithm enjoys sublinear dynamic regret, defined herein as the suboptimality against an optimal clairvoyant controller that knows how the unknown disturbances and system dynamics will adapt to its actions. The algorithm is self-supervised and applies to control-affine systems with unknown dynamics and disturbances that can be expressed in reproducing kernel Hilbert spaces. Such spaces can model external disturbances and modeling errors that can even be adaptive to the system's state and control input. For example, they can model wind and wave disturbances to aerial and marine vehicles, or inaccurate model parameters such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Control Systems and Identification · Fault Detection and Control Systems
