End-to-End Humanoid Robot Safe and Comfortable Locomotion Policy

Zifan Wang; Xun Yang; Jianzhuang Zhao; Jiaming Zhou; Teli Ma; Ziyao Gao; Arash Ajoudani; Junwei Liang

arXiv:2508.07611·cs.RO·August 12, 2025

End-to-End Humanoid Robot Safe and Comfortable Locomotion Policy

Zifan Wang, Xun Yang, Jianzhuang Zhao, Jiaming Zhou, Teli Ma, Ziyao Gao, Arash Ajoudani, Junwei Liang

PDF

Open Access

TL;DR

This paper presents an end-to-end reinforcement learning approach for humanoid robot navigation that directly processes LiDAR data, ensuring safety and comfort through formal safety constraints and human-centric motion rewards, successfully transferring from simulation to real robots.

Contribution

It introduces a novel safety-aware reinforcement learning framework using Control Barrier Functions within a CMDP, enabling safe, comfortable, and robust humanoid robot navigation in complex environments.

Findings

01

Successful sim-to-real transfer on a humanoid robot

02

Robust navigation around static and dynamic obstacles

03

Enhanced safety and comfort in robot motions

Abstract

The deployment of humanoid robots in unstructured, human-centric environments requires navigation capabilities that extend beyond simple locomotion to include robust perception, provable safety, and socially aware behavior. Current reinforcement learning approaches are often limited by blind controllers that lack environmental awareness or by vision-based systems that fail to perceive complex 3D obstacles. In this work, we present an end-to-end locomotion policy that directly maps raw, spatio-temporal LiDAR point clouds to motor commands, enabling robust navigation in cluttered dynamic scenes. We formulate the control problem as a Constrained Markov Decision Process (CMDP) to formally separate safety from task objectives. Our key contribution is a novel methodology that translates the principles of Control Barrier Functions (CBFs) into costs within the CMDP, allowing a model-free…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Social Robot Interaction and HRI · Robot Manipulation and Learning