TL;DR
This paper presents a system enabling robots to learn multi-step tasks from a single demonstration by localizing actions in auxiliary videos, learning reward functions, and applying reinforcement learning for policy acquisition.
Contribution
A novel approach that leverages auxiliary videos and meta-learning to enable one-shot learning of complex multi-step tasks in robots.
Findings
Robots learn multi-step tasks more effectively with auxiliary video data.
Localization of individual actions improves learning performance.
Performance significantly improves over unsegmented video learning.
Abstract
Due to burdensome data requirements, learning from demonstration often falls short of its promise to allow users to quickly and naturally program robots. Demonstrations are inherently ambiguous and incomplete, making correct generalization to unseen situations difficult without a large number of demonstrations in varying conditions. By contrast, humans are often able to learn complex tasks from a single demonstration (typically observations without action labels) by leveraging context learned over a lifetime. Inspired by this capability, our goal is to enable robots to perform one-shot learning of multi-step tasks from observation by leveraging auxiliary video data as context. Our primary contribution is a novel system that achieves this goal by: (1) using a single user-segmented demonstration to define the primitive actions that comprise a task, (2) localizing additional examples of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
