Constrained Skill Discovery: Quadruped Locomotion with Unsupervised   Reinforcement Learning

Vassil Atanassov; Wanming Yu; Alexander Luis Mitchell; Mark Nicholas; Finean; Ioannis Havoutis

arXiv:2410.07877·cs.RO·October 11, 2024

Constrained Skill Discovery: Quadruped Locomotion with Unsupervised Reinforcement Learning

Vassil Atanassov, Wanming Yu, Alexander Luis Mitchell, Mark Nicholas, Finean, Ioannis Havoutis

PDF

Open Access

TL;DR

This paper introduces a new unsupervised reinforcement learning method for quadruped robots that enhances skill diversity and stability, enabling accurate zero-shot navigation without task-specific rewards.

Contribution

The authors propose a novel constrained skill discovery approach using a norm-matching objective, improving state coverage and locomotion stability over previous methods.

Findings

01

Richer state space coverage achieved

02

More stable and controllable locomotion behaviors learned

03

Successful zero-shot navigation on real robot

Abstract

Representation learning and unsupervised skill discovery can allow robots to acquire diverse and reusable behaviors without the need for task-specific rewards. In this work, we use unsupervised reinforcement learning to learn a latent representation by maximizing the mutual information between skills and states subject to a distance constraint. Our method improves upon prior constrained skill discovery methods by replacing the latent transition maximization with a norm-matching objective. This not only results in a much a richer state space coverage compared to baseline methods, but allows the robot to learn more stable and easily controllable locomotive behaviors. We successfully deploy the learned policy on a real ANYmal quadruped robot and demonstrate that the robot can accurately reach arbitrary points of the Cartesian state space in a zero-shot manner, using only an intrinsic skill…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Robotic Path Planning Algorithms · Hand Gesture Recognition Systems