Learning Lipschitz Feedback Policies from Expert Demonstrations: Closed-Loop Guarantees, Generalization and Robustness
Abed AlRahman Al Makdah, Vishaal Krishnan, Fabio Pasqualetti

TL;DR
This paper introduces a method to learn feedback control policies from expert demonstrations that guarantees robustness and generalization in closed-loop systems using Lipschitz constraints, with theoretical bounds and empirical validation.
Contribution
It presents a novel Lipschitz-constrained learning framework that provides certified robustness and generalization guarantees for feedback policies learned from demonstrations.
Findings
Finite sample bounds on policy learning error
Guaranteed closed-loop stability under learned policies
Tradeoff between nominal performance and adversarial robustness
Abstract
In this work, we propose a framework to learn feedback control policies with guarantees on closed-loop generalization and adversarial robustness. These policies are learned directly from expert demonstrations, contained in a dataset of state-control input pairs, without any prior knowledge of the task and system model. We use a Lipschitz-constrained loss minimization scheme to learn feedback policies with certified closed-loop robustness, wherein the Lipschitz constraint serves as a mechanism to tune the generalization performance and robustness to adversarial disturbances. Our analysis exploits the Lipschitz property to obtain closed-loop guarantees on generalization and robustness of the learned policies. In particular, we derive a finite sample bound on the policy learning error and establish robust closed-loop stability under the learned control policy. We also derive bounds on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Model Reduction and Neural Networks
