Episodic Gaussian Process-Based Learning Control with Vanishing Tracking Errors
Armin Lederer, Jonas Umlauft, Sandra Hirche

TL;DR
This paper develops a novel episodic learning control method using Gaussian processes, providing guarantees of vanishing tracking errors as more data is accumulated, even for complex systems without accurate first-principles models.
Contribution
It introduces a Bayesian prediction error bound based on data density, enabling time-varying tracking guarantees and an episodic learning approach for control systems.
Findings
Prediction error bound decays with data density.
Vanishing tracking errors achieved with increasing data.
Episodic learning guarantees arbitrary accuracy.
Abstract
Due to the increasing complexity of technical systems, accurate first principle models can often not be obtained. Supervised machine learning can mitigate this issue by inferring models from measurement data. Gaussian process regression is particularly well suited for this purpose due to its high data-efficiency and its explicit uncertainty representation, which allows the derivation of prediction error bounds. These error bounds have been exploited to show tracking accuracy guarantees for a variety of control approaches, but their direct dependency on the training data is generally unclear. We address this issue by deriving a Bayesian prediction error bound for GP regression, which we show to decay with the growth of a novel, kernel-based measure of data density. Based on the prediction error bound, we prove time-varying tracking accuracy guarantees for learned GP models used as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Advanced Control Systems Optimization · Control Systems and Identification
MethodsGaussian Process
