Achieving Goals using Reward Shaping and Curriculum Learning

Mihai Anca; Jonathan D. Thomas; Dabal Pedamonti; Matthew Studley; Mark; Hansen

arXiv:2206.02462·cs.RO·April 21, 2023·1 cites

Achieving Goals using Reward Shaping and Curriculum Learning

Mihai Anca, Jonathan D. Thomas, Dabal Pedamonti, Matthew Studley, Mark, Hansen

PDF

Open Access

TL;DR

This paper presents a method combining curriculum learning, reward shaping, and parallel environments to efficiently train agents for complex goal-oriented robotic tasks, specifically stacking cubes, by breaking down the task into manageable sub-goals.

Contribution

It introduces two curriculum learning settings for goal-conditioned tasks, enabling the learning of complex tasks without extensive architectural changes.

Findings

01

Effective task decomposition into sub-goals

02

Successful application to cube stacking with varied object shapes

03

Enhanced learning efficiency through parallel environments

Abstract

Real-time control for robotics is a popular research area in the reinforcement learning community. Through the use of techniques such as reward shaping, researchers have managed to train online agents across a multitude of domains. Despite these advances, solving goal-oriented tasks still requires complex architectural changes or hard constraints to be placed on the problem. In this article, we solve the problem of stacking multiple cubes by combining curriculum learning, reward shaping, and a high number of efficiently parallelized environments. We introduce two curriculum learning settings that allow us to separate the complex task into sequential sub-goals, hence enabling the learning of a problem that may otherwise be too difficult. We focus on discussing the challenges encountered while implementing them in a goal-conditioned environment. Finally, we extend the best configuration…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics