Automatic Goal Generation using Dynamical Distance Learning

Bharat Prakash; Nicholas Waytowich; Tinoosh Mohsenin; Tim Oates

arXiv:2111.04120·cs.AI·November 9, 2021

Automatic Goal Generation using Dynamical Distance Learning

Bharat Prakash, Nicholas Waytowich, Tinoosh Mohsenin, Tim Oates

PDF

Open Access

TL;DR

This paper introduces a method for automatic goal generation in reinforcement learning using a dynamical distance function, creating curricula that improve sample efficiency in complex tasks.

Contribution

The paper proposes a novel self-supervised approach for goal curriculum generation using a dynamical distance function in multi-goal RL.

Findings

01

Improved sample efficiency over baseline methods

02

Effective curriculum generation for robotic tasks

03

Enhanced learning speed in goal-conditioned environments

Abstract

Reinforcement Learning (RL) agents can learn to solve complex sequential decision making tasks by interacting with the environment. However, sample efficiency remains a major challenge. In the field of multi-goal RL, where agents are required to reach multiple goals to solve complex tasks, improving sample efficiency can be especially challenging. On the other hand, humans or other biological agents learn such tasks in a much more strategic way, following a curriculum where tasks are sampled with increasing difficulty level in order to make gradual and efficient learning progress. In this work, we propose a method for automatic goal generation using a dynamical distance function (DDF) in a self-supervised fashion. DDF is a function which predicts the dynamical distance between any two states within a markov decision process (MDP). With this, we generate a curriculum of goals at the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms