Learning from Demonstration with Weakly Supervised Disentanglement
Yordan Hristov, Subramanian Ramamoorthy

TL;DR
This paper introduces a weakly supervised learning approach for robotic manipulation from demonstrations, aligning high-level concepts with high-dimensional sensory data using user-provided labels, improving interpretability and control.
Contribution
It proposes a probabilistic generative model with explicit alignment of latent variables to high-level concepts using weak labels, enhancing learning from demonstrations in robotics.
Findings
Effective in two manipulation tasks with PR2 robot
Improves interpretability of learned representations
Utilizes weak supervision to align high-level concepts
Abstract
Robotic manipulation tasks, such as wiping with a soft sponge, require control from multiple rich sensory modalities. Human-robot interaction, aimed at teaching robots, is difficult in this setting as there is potential for mismatch between human and machine comprehension of the rich data streams. We treat the task of interpretable learning from demonstration as an optimisation problem over a probabilistic generative model. To account for the high-dimensionality of the data, a high-capacity neural network is chosen to represent the model. The latent variables in this model are explicitly aligned with high-level notions and concepts that are manifested in a set of demonstrations. We show that such alignment is best achieved through the use of labels from the end user, in an appropriately restricted vocabulary, in contrast to the conventional approach of the designer picking a prior over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning
