IRIS: Implicit Reinforcement without Interaction at Scale for Learning   Control from Offline Robot Manipulation Data

Ajay Mandlekar; Fabio Ramos; Byron Boots; Silvio Savarese; Li Fei-Fei,; Animesh Garg; Dieter Fox

arXiv:1911.05321·cs.RO·February 25, 2020

IRIS: Implicit Reinforcement without Interaction at Scale for Learning Control from Offline Robot Manipulation Data

Ajay Mandlekar, Fabio Ramos, Byron Boots, Silvio Savarese, Li Fei-Fei,, Animesh Garg, Dieter Fox

PDF

TL;DR

IRIS is a scalable offline learning framework for robot manipulation that combines goal-conditioned control with goal selection to improve performance on diverse, large datasets.

Contribution

IRIS introduces a novel two-level approach that separates goal setting from low-level control, enabling effective learning from large, diverse, and suboptimal demonstration datasets.

Findings

01

IRIS achieves successful policy learning on large-scale datasets.

02

IRIS outperforms baseline methods on multiple datasets.

03

IRIS effectively handles suboptimal and diverse demonstrations.

Abstract

Learning from offline task demonstrations is a problem of great interest in robotics. For simple short-horizon manipulation tasks with modest variation in task instances, offline learning from a small set of demonstrations can produce controllers that successfully solve the task. However, leveraging a fixed batch of data can be problematic for larger datasets and longer-horizon tasks with greater variations. The data can exhibit substantial diversity and consist of suboptimal solution approaches. In this paper, we propose Implicit Reinforcement without Interaction at Scale (IRIS), a novel framework for learning from large-scale demonstration datasets. IRIS factorizes the control problem into a goal-conditioned low-level controller that imitates short demonstration sequences and a high-level goal selection mechanism that sets goals for the low-level and selectively combines parts of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.