Anticipating the Long-Term Effect of Online Learning in Control

Alexandre Capone; Sandra Hirche

arXiv:2007.12377·cs.LG·July 27, 2020

Anticipating the Long-Term Effect of Online Learning in Control

Alexandre Capone, Sandra Hirche

PDF

TL;DR

This paper introduces AntLer, a control design algorithm that explicitly anticipates future learning effects in uncertain systems, leading to improved control performance by optimizing expected costs.

Contribution

AntLer is a novel algorithm that incorporates future learning effects into control law design using probabilistic models and approximate optimization.

Findings

01

AntLer approximates optimal solutions with high probability.

02

Applying AntLer to nonlinear systems improves control outcomes.

03

AntLer explicitly accounts for future learning in control synthesis.

Abstract

Control schemes that learn using measurement data collected online are increasingly promising for the control of complex and uncertain systems. However, in most approaches of this kind, learning is viewed as a side effect that passively improves control performance, e.g., by updating a model of the system dynamics. Determining how improvements in control performance due to learning can be actively exploited in the control synthesis is still an open research question. In this paper, we present AntLer, a design algorithm for learning-based control laws that anticipates learning, i.e., that takes the impact of future learning in uncertain dynamic settings explicitly into account. AntLer expresses system uncertainty using a non-parametric probabilistic model. Given a cost function that measures control performance, AntLer chooses the control parameters such that the expected cost of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.