OmniXtreme: Breaking the Generality Barrier in High-Dynamic Humanoid Control

Yunshen Wang; Shaohang Zhu; Peiyuan Zhi; Yuhan Li; Jiaxin Li; Yong-Lu Li; Yuchen Xiao; Xingxing Wang; Baoxiong Jia; Siyuan Huang

arXiv:2602.23843·cs.RO·March 2, 2026

OmniXtreme: Breaking the Generality Barrier in High-Dynamic Humanoid Control

Yunshen Wang, Shaohang Zhu, Peiyuan Zhi, Yuhan Li, Jiaxin Li, Yong-Lu Li, Yuchen Xiao, Xingxing Wang, Baoxiong Jia, Siyuan Huang

PDF

Open Access

TL;DR

OmniXtreme introduces a scalable framework that decouples general motor skill learning from physical refinement, enabling high-fidelity, diverse high-dynamic humanoid motion tracking and execution on real robots.

Contribution

It presents a novel decoupled approach combining flow-matching policies with actuation-aware refinement to overcome scalability and physical constraints in humanoid control.

Findings

01

Maintains high-fidelity tracking across diverse datasets

02

Successfully executes multiple extreme motions on real robots

03

Breaks the fidelity-scalability trade-off in high-dynamic control

Abstract

High-fidelity motion tracking serves as the ultimate litmus test for generalizable, human-level motor skills. However, current policies often hit a "generality barrier": as motion libraries scale in diversity, tracking fidelity inevitably collapses - especially for real-world deployment of high-dynamic motions. We identify this failure as the result of two compounding factors: the learning bottleneck in scaling multi-motion optimization and the physical executability constraints that arise in real-world actuation. To overcome these challenges, we introduce OmniXtreme, a scalable framework that decouples general motor skill learning from sim-to-real physical skill refinement. Our approach uses a flow-matching policy with high-capacity architectures to scale representation capacity without interference-intensive multi-motion RL optimization, followed by an actuation-aware refinement phase…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Human Pose and Action Recognition