Fail-Safe Adversarial Generative Imitation Learning

Philipp Geiger; Christoph-Nikolas Straehle

arXiv:2203.01696·cs.LG·July 31, 2023

Fail-Safe Adversarial Generative Imitation Learning

Philipp Geiger, Christoph-Nikolas Straehle

PDF

Open Access 1 Repo

TL;DR

This paper introduces a safe generative adversarial imitation learning framework with a safety layer that ensures actions are safe, providing theoretical safety guarantees and demonstrating effectiveness on real-world driver data.

Contribution

It presents a novel safety layer with a closed-form density and gradient, enabling safe end-to-end adversarial training with theoretical robustness guarantees.

Findings

01

The safety layer improves robustness during training.

02

The method achieves safe and effective imitation on real-world data.

03

Theoretical analysis confirms linear imitation error growth with horizon.

Abstract

For flexible yet safe imitation learning (IL), we propose theory and a modular method, with a safety layer that enables a closed-form probability density/gradient of the safe generative continuous policy, end-to-end generative adversarial training, and worst-case safety guarantees. The safety layer maps all actions into a set of safe actions, and uses the change-of-variables formula plus additivity of measures for the density. The set of safe actions is inferred by first checking safety of a finite sample of actions via adversarial reachability analysis of fallback maneuvers, and then concluding on the safety of these actions' neighborhoods using, e.g., Lipschitz continuity. We provide theoretical analysis showing the robustness advantage of using the safety layer already during training (imitation error linear in the horizon) compared to only using it at test time (up to quadratic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

boschresearch/fagil
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Model Reduction and Neural Networks