Stochastic Action Prediction for Imitation Learning

Sagar Gubbi Venkatesh; Nihesh Rathod; Shishir Kolathaya and; Bharadwaj Amrutur

arXiv:2101.01055·cs.LG·January 5, 2021

Stochastic Action Prediction for Imitation Learning

Sagar Gubbi Venkatesh, Nihesh Rathod, Shishir Kolathaya and, Bharadwaj Amrutur

PDF

Open Access

TL;DR

This paper investigates the inherent randomness in expert demonstrations for imitation learning and shows that modeling this stochasticity improves task success rates.

Contribution

It introduces methods to model stochasticity in demonstration data using autoregressive, GAN, and variational approaches, demonstrating their effectiveness.

Findings

01

Modeling stochasticity improves imitation learning success rates.

02

Autoregressive, GAN, and variational methods effectively capture data variability.

03

Accounting for stochasticity leads to substantial performance gains.

Abstract

Imitation learning is a data-driven approach to acquiring skills that relies on expert demonstrations to learn a policy that maps observations to actions. When performing demonstrations, experts are not always consistent and might accomplish the same task in slightly different ways. In this paper, we demonstrate inherent stochasticity in demonstrations collected for tasks including line following with a remote-controlled car and manipulation tasks including reaching, pushing, and picking and placing an object. We model stochasticity in the data distribution using autoregressive action generation, generative adversarial nets, and variational prediction and compare the performance of these approaches. We find that accounting for stochasticity in the expert data leads to substantial improvement in the success rate of task completion.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Human Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis