Chasing Autonomy: Dynamic Retargeting and Control Guided RL for Performant and Controllable Humanoid Running

Zachary Olkin; William D. Compton; Ryan M. Bena; Aaron D. Ames

arXiv:2603.25902·cs.RO·March 30, 2026

Chasing Autonomy: Dynamic Retargeting and Control Guided RL for Performant and Controllable Humanoid Running

Zachary Olkin, William D. Compton, Ryan M. Bena, Aaron D. Ames

PDF

TL;DR

This paper introduces a reinforcement learning-based control pipeline for humanoid robots that enables dynamic, controllable, and long-duration running by retargeting human motions and optimizing reward structures, demonstrated on hardware.

Contribution

It presents a novel pipeline for retargeting human motions and optimizing rewards to improve humanoid running performance and controllability in real-world environments.

Findings

01

Achieved running speeds up to 3.3 m/s on hardware.

02

Demonstrated hundreds of meters of autonomous outdoor running.

03

Control-guided reward improves velocity tracking performance.

Abstract

Humanoid robots have the promise of locomoting like humans, including fast and dynamic running. Recently, reinforcement learning (RL) controllers that can mimic human motions have become popular as they can generate very dynamic behaviors, but they are often restricted to single motion play-back which hinders their deployment in long duration and autonomous locomotion. In this paper, we present a pipeline to dynamically retarget human motions through an optimization routine with hard constraints to generate improved periodic reference libraries from a single human demonstration. We then study the effect of both the reference motion and the reward structure on the reference and commanded velocity tracking, concluding that a goal-conditioned and control-guided reward which tracks dynamically optimized human data results in the best performance. We deploy the policy on hardware,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.