Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
Seongun Kim, Kyowoon Lee, Jaesik Choi

TL;DR
This paper introduces a novel curriculum-based reinforcement learning method called VUVC that enhances unsupervised skill discovery, improves sample efficiency, and accelerates state coverage in complex navigation and robotic tasks.
Contribution
It recasts variational empowerment as curriculum learning and proposes VUVC, a new approach that accelerates state entropy increase and improves skill discovery efficiency.
Findings
VUVC accelerates entropy increase in visited states.
Skills learned enable zero-shot robot navigation.
Method improves sample efficiency and state coverage.
Abstract
Mutual information-based reinforcement learning (RL) has been proposed as a promising framework for retrieving complex skills autonomously without a task-oriented reward function through mutual information (MI) maximization or variational empowerment. However, learning complex skills is still challenging, due to the fact that the order of training skills can largely affect sample efficiency. Inspired by this, we recast variational empowerment as curriculum learning in goal-conditioned RL with an intrinsic reward function, which we name Variational Curriculum RL (VCRL). From this perspective, we propose a novel approach to unsupervised skill discovery based on information theory, called Value Uncertainty Variational Curriculum (VUVC). We prove that, under regularity conditions, VUVC accelerates the increase of entropy in the visited states compared to the uniform curriculum. We validate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Software Engineering Research
