Learning from Demonstration with Weakly Supervised Disentanglement

Yordan Hristov; Subramanian Ramamoorthy

arXiv:2006.09107·cs.RO·March 29, 2021·1 cites

Learning from Demonstration with Weakly Supervised Disentanglement

Yordan Hristov, Subramanian Ramamoorthy

PDF

Open Access 1 Video

TL;DR

This paper introduces a weakly supervised learning approach for robotic manipulation from demonstrations, aligning high-level concepts with high-dimensional sensory data using user-provided labels, improving interpretability and control.

Contribution

It proposes a probabilistic generative model with explicit alignment of latent variables to high-level concepts using weak labels, enhancing learning from demonstrations in robotics.

Findings

01

Effective in two manipulation tasks with PR2 robot

02

Improves interpretability of learned representations

03

Utilizes weak supervision to align high-level concepts

Abstract

Robotic manipulation tasks, such as wiping with a soft sponge, require control from multiple rich sensory modalities. Human-robot interaction, aimed at teaching robots, is difficult in this setting as there is potential for mismatch between human and machine comprehension of the rich data streams. We treat the task of interpretable learning from demonstration as an optimisation problem over a probabilistic generative model. To account for the high-dimensionality of the data, a high-capacity neural network is chosen to represent the model. The latent variables in this model are explicitly aligned with high-level notions and concepts that are manifested in a set of demonstrations. We show that such alignment is best achieved through the use of labels from the end user, in an appropriately restricted vocabulary, in contrast to the conventional approach of the designer picking a prior over…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Learning from Demonstration with Weakly Supervised Disentanglement· slideslive

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning