Goal-Conditioned Reinforcement Learning with Imagined Subgoals

Elliot Chane-Sane; Cordelia Schmid; Ivan Laptev

arXiv:2107.00541·cs.LG·July 2, 2021·26 cites

Goal-Conditioned Reinforcement Learning with Imagined Subgoals

Elliot Chane-Sane, Cordelia Schmid, Ivan Laptev

PDF

Open Access 1 Video

TL;DR

This paper introduces a goal-conditioned reinforcement learning method that uses imagined subgoals predicted by a high-level policy to improve learning of complex, temporally extended tasks, especially in robotics.

Contribution

It proposes a novel approach combining imagined subgoals with KL-constrained policy iteration to enhance learning efficiency and performance in complex tasks.

Findings

01

Outperforms existing methods on robotic navigation tasks

02

Improves learning speed and stability

03

Effectively handles temporally extended reasoning

Abstract

Goal-conditioned reinforcement learning endows an agent with a large variety of skills, but it often struggles to solve tasks that require more temporally extended reasoning. In this work, we propose to incorporate imagined subgoals into policy learning to facilitate learning of complex tasks. Imagined subgoals are predicted by a separate high-level policy, which is trained simultaneously with the policy and its critic. This high-level policy predicts intermediate states halfway to the goal using the value function as a reachability metric. We don't require the policy to reach these subgoals explicitly. Instead, we use them to define a prior policy, and incorporate this prior into a KL-constrained policy iteration scheme to speed up and regularize learning. Imagined subgoals are used during policy learning, but not during test time, where we only apply the learned policy. We evaluate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Goal-Conditioned Reinforcement Learning with Imagined Subgoals· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Explainable Artificial Intelligence (XAI)