Regret Analysis of Online Gradient Descent-based Iterative Learning   Control with Model Mismatch

Efe C. Balta; Andrea Iannelli; Roy S. Smith; John Lygeros

arXiv:2204.04722·eess.SY·April 12, 2022

Regret Analysis of Online Gradient Descent-based Iterative Learning Control with Model Mismatch

Efe C. Balta, Andrea Iannelli, Roy S. Smith, John Lygeros

PDF

TL;DR

This paper analyzes the performance of online gradient descent in iterative learning control with model mismatch, focusing on regret bounds and limitations, supported by numerical simulations.

Contribution

It introduces a regret-based analysis framework for ILC with model mismatch using online learning concepts, highlighting fundamental limitations and potential integration with adaptation.

Findings

01

Performance bounds for online gradient descent in ILC

02

Identification of fundamental limitations of the scheme

03

Numerical validation on benchmark ILC problem

Abstract

In Iterative Learning Control (ILC), a sequence of feedforward control actions is generated at each iteration on the basis of partial model knowledge and past measurements with the goal of steering the system toward a desired reference trajectory. This is framed here as an online learning task, where the decision-maker takes sequential decisions by solving a sequence of optimization problems having only partial knowledge of the cost functions. Having established this connection, the performance of an online gradient-descent based scheme using inexact gradient information is analyzed in the setting of dynamic and static regret, standard measures in online learning. Fundamental limitations of the scheme and its integration with adaptation mechanisms are further investigated, followed by numerical simulations on a benchmark ILC problem.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.