Provable Guarantees for Generative Behavior Cloning: Bridging Low-Level   Stability and High-Level Behavior

Adam Block; Ali Jadbabaie; Daniel Pfrommer; Max Simchowitz; Russ; Tedrake

arXiv:2307.14619·cs.LG·October 25, 2023

Provable Guarantees for Generative Behavior Cloning: Bridging Low-Level Stability and High-Level Behavior

Adam Block, Ali Jadbabaie, Daniel Pfrommer, Max Simchowitz, Russ, Tedrake

PDF

Open Access

TL;DR

This paper introduces a theoretical framework for behavior cloning using generative models, emphasizing low-level stability and a novel noise augmentation technique to closely imitate expert trajectories in complex tasks.

Contribution

It provides a new theoretical analysis linking low-level controller stability with high-level imitation quality, and proposes a noise augmentation method to ensure trajectory distribution closeness.

Findings

01

The framework guarantees imitation quality under stability assumptions.

02

Adding augmentation noise improves imitation accuracy with minimal degradation.

03

Empirical validation supports the proposed theoretical insights.

Abstract

We propose a theoretical framework for studying behavior cloning of complex expert demonstrations using generative modeling. Our framework invokes low-level controllers - either learned or implicit in position-command control - to stabilize imitation around expert demonstrations. We show that with (a) a suitable low-level stability guarantee and (b) a powerful enough generative model as our imitation learner, pure supervised behavior cloning can generate trajectories matching the per-time step distribution of essentially arbitrary expert trajectories in an optimal transport cost. Our analysis relies on a stochastic continuity property of the learned policy we call "total variation continuity" (TVC). We then show that TVC can be ensured with minimal degradation of accuracy by combining a popular data-augmentation regimen with a novel algorithmic trick: adding augmentation noise at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Music Technology and Sound Studies · Model Reduction and Neural Networks

MethodsDiffusion