Learning Velocity-based Humanoid Locomotion: Massively Parallel Learning   with Brax and MJX

William Thibault; William Melek; Katja Mombaur

arXiv:2407.05148·cs.RO·July 9, 2024

Learning Velocity-based Humanoid Locomotion: Massively Parallel Learning with Brax and MJX

William Thibault, William Melek, Katja Mombaur

PDF

Open Access

TL;DR

This paper introduces a velocity-based reinforcement learning approach for humanoid locomotion, utilizing parallel simulation in Brax and MJX to enable fast training and potential real-world application.

Contribution

It presents a novel velocity-based RL policy for humanoid robots, implemented in Brax/MJX for efficient parallel training and future real-world deployment.

Findings

01

Fast training achieved with Brax/MJX simulation

02

Policy demonstrates effective velocity-based locomotion

03

Simulation results show promising performance

Abstract

Humanoid locomotion is a key skill to bring humanoids out of the lab and into the real-world. Many motion generation methods for locomotion have been proposed including reinforcement learning (RL). RL locomotion policies offer great versatility and generalizability along with the ability to experience new knowledge to improve over time. This work presents a velocity-based RL locomotion policy for the REEM-C robot. The policy uses a periodic reward formulation and is implemented in Brax/MJX for fast training. Simulation results for the policy are demonstrated with future experimental results in progress.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Robotic Locomotion and Control · Hand Gesture Recognition Systems