Ergodic imitation: Learning from what to do and what not to do

Aleksandra Kalinowska; Ahalya Prabhakar; Kathleen Fitzsimons; and Todd; Murphey

arXiv:2103.17098·cs.RO·April 1, 2021

Ergodic imitation: Learning from what to do and what not to do

Aleksandra Kalinowska, Ahalya Prabhakar, Kathleen Fitzsimons, and Todd, Murphey

PDF

TL;DR

This paper introduces a novel learning from demonstration framework that uses ergodic metrics and negative demonstrations to enable robots to learn robust tasks efficiently, even from imperfect or suboptimal examples.

Contribution

It proposes a new algorithmic approach combining ergodic metrics and negative demonstrations for more robust and data-efficient robot learning from demonstrations.

Findings

01

Negative demonstrations can compensate for imperfect positive demonstrations.

02

Combined positive and negative demonstrations improve task learning efficiency.

03

Negative demonstrations alone can successfully teach certain skills.

Abstract

With growing access to versatile robotics, it is beneficial for end users to be able to teach robots tasks without needing to code a control policy. One possibility is to teach the robot through successful task executions. However, near-optimal demonstrations of a task can be difficult to provide and even successful demonstrations can fail to capture task aspects key to robust skill replication. Here, we propose a learning from demonstration (LfD) approach that enables learning of robust task definitions without the need for near-optimal demonstrations. We present a novel algorithmic framework for learning tasks based on the ergodic metric -- a measure of information content in motion. Moreover, we make use of negative demonstrations -- demonstrations of what not to do -- and show that they can help compensate for imperfect demonstrations, reduce the number of demonstrations needed, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.