Curriculum Learning with Hindsight Experience Replay for Sequential   Object Manipulation Tasks

Binyamin Manela; Armin Biess

arXiv:2008.09377·cs.LG·August 24, 2020

Curriculum Learning with Hindsight Experience Replay for Sequential Object Manipulation Tasks

Binyamin Manela, Armin Biess

PDF

TL;DR

This paper introduces a novel algorithm combining curriculum learning with Hindsight Experience Replay to effectively learn complex sequential object manipulation tasks with sparse rewards, demonstrating significant improvements over standard HER.

Contribution

The study presents a new algorithm that integrates curriculum learning with HER, exploiting task structure for improved learning efficiency in complex manipulation tasks.

Findings

01

Vast improvements over vanilla-HER in three throwing tasks

02

Effective learning with sparse feedback and multiple goals

03

Utilizes recurrent structure without adjusting simulation per task

Abstract

Learning complex tasks from scratch is challenging and often impossible for humans as well as for artificial agents. A curriculum can be used instead, which decomposes a complex task (target task) into a sequence of source tasks (the curriculum). Each source task is a simplified version of the next source task with increasing complexity. Learning then occurs gradually by training on each source task while using knowledge from the curriculum's prior source tasks. In this study, we present a new algorithm that combines curriculum learning with Hindsight Experience Replay (HER), to learn sequential object manipulation tasks for multiple goals and sparse feedback. The algorithm exploits the recurrent structure inherent in many object manipulation tasks and implements the entire learning process in the original simulation without adjusting it to each source task. We have tested our algorithm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsExperience Replay