Variational Curriculum Reinforcement Learning for Unsupervised Discovery   of Skills

Seongun Kim; Kyowoon Lee; Jaesik Choi

arXiv:2310.19424·cs.LG·October 31, 2023·1 cites

Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills

Seongun Kim, Kyowoon Lee, Jaesik Choi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel curriculum-based reinforcement learning method called VUVC that enhances unsupervised skill discovery, improves sample efficiency, and accelerates state coverage in complex navigation and robotic tasks.

Contribution

It recasts variational empowerment as curriculum learning and proposes VUVC, a new approach that accelerates state entropy increase and improves skill discovery efficiency.

Findings

01

VUVC accelerates entropy increase in visited states.

02

Skills learned enable zero-shot robot navigation.

03

Method improves sample efficiency and state coverage.

Abstract

Mutual information-based reinforcement learning (RL) has been proposed as a promising framework for retrieving complex skills autonomously without a task-oriented reward function through mutual information (MI) maximization or variational empowerment. However, learning complex skills is still challenging, due to the fact that the order of training skills can largely affect sample efficiency. Inspired by this, we recast variational empowerment as curriculum learning in goal-conditioned RL with an intrinsic reward function, which we name Variational Curriculum RL (VCRL). From this perspective, we propose a novel approach to unsupervised skill discovery based on information theory, called Value Uncertainty Variational Curriculum (VUVC). We prove that, under regularity conditions, VUVC accelerates the increase of entropy in the visited states compared to the uniform curriculum. We validate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

seongun-kim/vcrl
pytorchOfficial

Videos

Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills· slideslive

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Software Engineering Research