Bayesian Disturbance Injection: Robust Imitation Learning of Flexible   Policies

Hanbit Oh; Hikaru Sasaki; Brendan Michael; Takamitsu Matsubara

arXiv:2103.13623·cs.RO·November 8, 2022

Bayesian Disturbance Injection: Robust Imitation Learning of Flexible Policies

Hanbit Oh, Hikaru Sasaki, Brendan Michael, Takamitsu Matsubara

PDF

Open Access

TL;DR

This paper introduces a Bayesian variational inference framework for imitation learning that learns flexible multi-action policies and enhances robustness by incorporating disturbances, improving generalization and safety in robotic tasks.

Contribution

It presents the first Bayesian non-parametric imitation learning method that combines policy flexibility with robustness through disturbance injection.

Findings

01

Improved policy flexibility and robustness in simulations and real robot experiments.

02

Enhanced learning performance and control safety over comparison methods.

03

Effective handling of multi-action decision scenarios in robotic manipulation.

Abstract

Scenarios requiring humans to choose from multiple seemingly optimal actions are commonplace, however standard imitation learning often fails to capture this behavior. Instead, an over-reliance on replicating expert actions induces inflexible and unstable policies, leading to poor generalizability in an application. To address the problem, this paper presents the first imitation learning framework that incorporates Bayesian variational inference for learning flexible non-parametric multi-action policies, while simultaneously robustifying the policies against sources of error, by introducing and optimizing disturbances to create a richer demonstration dataset. This combinatorial approach forces the policy to adapt to challenging situations, enabling stable multi-action policies to be learned efficiently. The effectiveness of our proposed method is evaluated through simulations and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Machine Learning and Algorithms