Divide & Conquer Imitation Learning
Alexandre Chenu, Nicolas Perrin-Gilbert, Olivier Sigaud

TL;DR
This paper introduces a divide and conquer imitation learning algorithm that efficiently learns complex robotic tasks from a single expert demonstration by decomposing tasks into smaller skills.
Contribution
The novel algorithm leverages a sequential inductive bias to break down complex tasks into manageable skills for goal-conditioned policy learning.
Findings
Successfully imitates a non-holonomic navigation task
Scales to complex robotic manipulation with high sample efficiency
Operates effectively with only a single expert demonstration
Abstract
When cast into the Deep Reinforcement Learning framework, many robotics tasks require solving a long horizon and sparse reward problem, where learning algorithms struggle. In such context, Imitation Learning (IL) can be a powerful approach to bootstrap the learning process. However, most IL methods require several expert demonstrations which can be prohibitively difficult to acquire. Only a handful of IL algorithms have shown efficiency in the context of an extreme low expert data regime where a single expert demonstration is available. In this paper, we present a novel algorithm designed to imitate complex robotic tasks from the states of an expert trajectory. Based on a sequential inductive bias, our method divides the complex task into smaller skills. The skills are learned into a goal-conditioned policy that is able to solve each skill individually and chain skills to solve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Mobile Crowdsensing and Crowdsourcing
