Learning Humanoid Locomotion with World Model Reconstruction

Wandong Sun; Long Chen; Yongbo Su; Baoshi Cao; Yang Liu; Zongwu Xie

arXiv:2502.16230·cs.RO·February 25, 2025

Learning Humanoid Locomotion with World Model Reconstruction

Wandong Sun, Long Chen, Yongbo Su, Baoshi Cao, Yang Liu, Zongwu Xie

PDF

Open Access

TL;DR

This paper introduces World Model Reconstruction (WMR), an end-to-end learning approach enabling humanoid robots to navigate complex terrains blindly by reconstructing the environment and improving locomotion robustness.

Contribution

The study presents a novel joint training method for world state reconstruction and locomotion policy, enhancing real-world terrain navigation without relying on sensor data.

Findings

01

Successfully navigated 3.2 km across icy and snowy terrains

02

Demonstrated robustness on deformable and slippery surfaces

03

Achieved autonomous long-distance hiking in real-world conditions

Abstract

Humanoid robots are designed to navigate environments accessible to humans using their legs. However, classical research has primarily focused on controlled laboratory settings, resulting in a gap in developing controllers for navigating complex real-world terrains. This challenge mainly arises from the limitations and noise in sensor data, which hinder the robot's understanding of itself and the environment. In this study, we introduce World Model Reconstruction (WMR), an end-to-end learning-based approach for blind humanoid locomotion across challenging terrains. We propose training an estimator to explicitly reconstruct the world state and utilize it to enhance the locomotion policy. The locomotion policy takes inputs entirely from the reconstructed information. The policy and the estimator are trained jointly; however, the gradient between them is intentionally cut off. This ensures…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Human Motion and Animation · Robotic Locomotion and Control