Reinforcement Learning with Data Bootstrapping for Dynamic Subgoal Pursuit in Humanoid Robot Navigation

Chengyang Peng; Zhihao Zhang; Shiting Gong; Sankalp Agrawal; Keith A. Redmill; and Ayonga Hereid

arXiv:2506.02206·cs.RO·June 4, 2025

Reinforcement Learning with Data Bootstrapping for Dynamic Subgoal Pursuit in Humanoid Robot Navigation

Chengyang Peng, Zhihao Zhang, Shiting Gong, Sankalp Agrawal, Keith A. Redmill, and Ayonga Hereid

PDF

Open Access

TL;DR

This paper introduces a hierarchical reinforcement learning framework with data bootstrapping for efficient, safe, and adaptable humanoid robot navigation in cluttered environments, combining high-level subgoal planning with low-level gait control.

Contribution

It presents a novel hierarchical approach integrating RL and MPC for dynamic subgoal pursuit, enhanced by data bootstrapping to improve training stability and efficiency.

Findings

01

Significantly higher navigation success rates in simulation.

02

Enhanced adaptability to diverse obstacle scenarios.

03

Faster training convergence with data bootstrapping.

Abstract

Safe and real-time navigation is fundamental for humanoid robot applications. However, existing bipedal robot navigation frameworks often struggle to balance computational efficiency with the precision required for stable locomotion. We propose a novel hierarchical framework that continuously generates dynamic subgoals to guide the robot through cluttered environments. Our method comprises a high-level reinforcement learning (RL) planner for subgoal selection in a robot-centric coordinate system and a low-level Model Predictive Control (MPC) based planner which produces robust walking gaits to reach these subgoals. To expedite and stabilize the training process, we incorporate a data bootstrapping technique that leverages a model-based navigation approach to generate a diverse, informative dataset. We validate our method in simulation using the Agility Robotics Digit humanoid across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Robotic Locomotion and Control