Meta-Learning Guarantees for Online Receding Horizon Learning Control

Deepan Muthirayan; Pramod P. Khargonekar

arXiv:2010.11327·eess.SY·November 2, 2022

Meta-Learning Guarantees for Online Receding Horizon Learning Control

Deepan Muthirayan, Pramod P. Khargonekar

PDF

Open Access

TL;DR

This paper establishes regret guarantees for an online meta-learning receding horizon control algorithm applied to unknown linear systems, demonstrating improved learning performance over multiple iterations.

Contribution

It provides the first provable regret bounds for meta-learning-based receding horizon control in iterative, unknown linear systems with constraints.

Findings

01

Achieves sub-linear regret with increasing iterations.

02

Regret bound improves as the number of iterations grows.

03

Guarantees rate of learning improvement over time.

Abstract

In this paper we provide provable regret guarantees for an online meta-learning receding horizon control algorithm in an iterative control setting. We consider the setting where, in each iteration the system to be controlled is a linear deterministic system that is different and unknown, the cost for the controller in an iteration is a general additive cost function and there are affine control input constraints. By analysing conditions under which sub-linear regret is achievable, we prove that the meta-learning online receding horizon controller achieves an average of the dynamic regret for the controller cost that is $\tilde{O} ((1 + 1/ N) T^{3/4})$ with the number of iterations $N$ . Thus, we show that the worst regret for learning within an iteration improves with experience of more iterations, with guarantee on rate of improvement.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Advanced Control Systems Optimization · Adaptive Dynamic Programming Control