HiWET: Hierarchical World-Frame End-Effector Tracking for Long-Horizon Humanoid Loco-Manipulation

Zhanxiang Cao; Liyun Yan; Yang Zhang; Sirui Chen; Jianming Ma; Tianyue Zhan; Shengcheng Fu; Yufei Jia; Cewu Lu; Yue Gao

arXiv:2602.06341·cs.RO·February 9, 2026

HiWET: Hierarchical World-Frame End-Effector Tracking for Long-Horizon Humanoid Loco-Manipulation

Zhanxiang Cao, Liyun Yan, Yang Zhang, Sirui Chen, Jianming Ma, Tianyue Zhan, Shengcheng Fu, Yufei Jia, Cewu Lu, Yue Gao

PDF

Open Access

TL;DR

This paper introduces HiWET, a hierarchical reinforcement learning framework for humanoid robots that improves long-horizon manipulation by tracking end-effectors in the world frame, ensuring stability and accuracy.

Contribution

It proposes a novel hierarchical approach with a Kinematic Manifold Prior to enhance world-frame end-effector tracking and stability in humanoid loco-manipulation tasks.

Findings

01

Achieves precise end-effector tracking in simulation

02

Demonstrates stable zero-shot sim-to-real transfer

03

Reduces exploration complexity with KMP

Abstract

Humanoid loco-manipulation requires executing precise manipulation tasks while maintaining dynamic stability amid base motion and impacts. Existing approaches typically formulate commands in body-centric frames, fail to inherently correct cumulative world-frame drift induced by legged locomotion. We reformulate the problem as world-frame end-effector tracking and propose HiWET, a hierarchical reinforcement learning framework that decouples global reasoning from dynamic execution. The high-level policy generates subgoals that jointly optimize end-effector accuracy and base positioning in the world frame, while the low-level policy executes these commands under stability constraints. We introduce a Kinematic Manifold Prior (KMP) that embeds the manipulation manifold into the action space via residual learning, reducing exploration dimensionality and mitigating kinematically invalid…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Human Motion and Animation · Reinforcement Learning in Robotics