On the Benefits of Inducing Local Lipschitzness for Robust Generative   Adversarial Imitation Learning

Farzan Memarian; Abolfazl Hashemi; Scott Niekum; Ufuk Topcu

arXiv:2107.00116·cs.LG·January 17, 2024

On the Benefits of Inducing Local Lipschitzness for Robust Generative Adversarial Imitation Learning

Farzan Memarian, Abolfazl Hashemi, Scott Niekum, Ufuk Topcu

PDF

Open Access

TL;DR

This paper introduces a regularization technique to induce local Lipschitzness in GAIL's discriminator and generator, significantly enhancing policy robustness against observation noise in robotics tasks.

Contribution

The paper proposes a novel regularization method to enforce local Lipschitzness in GAIL, improving robustness of learned policies against noisy observations.

Findings

01

Robust policies outperform state-of-the-art GAIL in noisy environments

02

Training a Lipschitz discriminator induces a Lipschitz generator

03

Method demonstrates significant robustness improvements in MuJoCo environments

Abstract

We explore methodologies to improve the robustness of generative adversarial imitation learning (GAIL) algorithms to observation noise. Towards this objective, we study the effect of local Lipschitzness of the discriminator and the generator on the robustness of policies learned by GAIL. In many robotics applications, the learned policies by GAIL typically suffer from a degraded performance at test time since the observations from the environment might be corrupted by noise. Hence, robustifying the learned policies against the observation noise is of critical importance. To this end, we propose a regularization method to induce local Lipschitzness in the generator and the discriminator of adversarial imitation learning methods. We show that the modified objective leads to learning significantly more robust policies. Moreover, we demonstrate -- both theoretically and experimentally --…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Robot Manipulation and Learning · Anomaly Detection Techniques and Applications

MethodsGenerative Adversarial Imitation Learning