Learning from Successful and Failed Demonstrations via Optimization

Brendan Hertel; S. Reza Ahmadzadeh

arXiv:2107.11918·cs.RO·July 1, 2024·1 cites

Learning from Successful and Failed Demonstrations via Optimization

Brendan Hertel, S. Reza Ahmadzadeh

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel Learning from Demonstration method that leverages both successful and failed demonstrations to improve skill learning and reproduction in robotic manipulation tasks.

Contribution

It proposes a new statistical skill model that encodes both demonstration types and finds optimal reproductions balancing success and failure data.

Findings

01

Successfully reproduces skills from failed demonstrations.

02

Outperforms existing LfD approaches in experiments.

03

Effective in multi-coordinate and real-world scenarios.

Abstract

Learning from Demonstration (LfD) is a popular approach that allows humans to teach robots new skills by showing the correct way(s) of performing the desired skill. Human-provided demonstrations, however, are not always optimal and the teacher usually addresses this issue by discarding or replacing sub-optimal (noisy or faulty) demonstrations. We propose a novel LfD representation that learns from both successful and failed demonstrations of a skill. Our approach encodes the two subsets of captured demonstrations (labeled by the teacher) into a statistical skill model, constructs a set of quadratic costs, and finds an optimal reproduction of the skill under novel problem conditions (i.e. constraints). The optimal reproduction balances convergence towards successful examples and divergence from failed examples. We evaluate our approach through several 2D and 3D experiments in real-world…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

brenhertel/TLFSD
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Machine Learning and Algorithms