Learning Lipschitz Feedback Policies from Expert Demonstrations:   Closed-Loop Guarantees, Generalization and Robustness

Abed AlRahman Al Makdah; Vishaal Krishnan; Fabio Pasqualetti

arXiv:2103.16629·cs.LG·November 3, 2022

Learning Lipschitz Feedback Policies from Expert Demonstrations: Closed-Loop Guarantees, Generalization and Robustness

Abed AlRahman Al Makdah, Vishaal Krishnan, Fabio Pasqualetti

PDF

Open Access

TL;DR

This paper introduces a method to learn feedback control policies from expert demonstrations that guarantees robustness and generalization in closed-loop systems using Lipschitz constraints, with theoretical bounds and empirical validation.

Contribution

It presents a novel Lipschitz-constrained learning framework that provides certified robustness and generalization guarantees for feedback policies learned from demonstrations.

Findings

01

Finite sample bounds on policy learning error

02

Guaranteed closed-loop stability under learned policies

03

Tradeoff between nominal performance and adversarial robustness

Abstract

In this work, we propose a framework to learn feedback control policies with guarantees on closed-loop generalization and adversarial robustness. These policies are learned directly from expert demonstrations, contained in a dataset of state-control input pairs, without any prior knowledge of the task and system model. We use a Lipschitz-constrained loss minimization scheme to learn feedback policies with certified closed-loop robustness, wherein the Lipschitz constraint serves as a mechanism to tune the generalization performance and robustness to adversarial disturbances. Our analysis exploits the Lipschitz property to obtain closed-loop guarantees on generalization and robustness of the learned policies. In particular, we derive a finite sample bound on the policy learning error and establish robust closed-loop stability under the learned control policy. We also derive bounds on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Model Reduction and Neural Networks