HyperMotionX: The Dataset and Benchmark with DiT-Based Pose-Guided Human Image Animation of Complex Motions

Shuolin Xu; Siming Zheng; Ziyi Wang; HC Yu; Jinwei Chen; Huaqi Zhang; Daquan Zhou; Tong-Yee Lee; Bo Li; Peng-Tao Jiang

arXiv:2505.22977·cs.CV·March 19, 2026

HyperMotionX: The Dataset and Benchmark with DiT-Based Pose-Guided Human Image Animation of Complex Motions

Shuolin Xu, Siming Zheng, Ziyi Wang, HC Yu, Jinwei Chen, Huaqi Zhang, Daquan Zhou, Tong-Yee Lee, Bo Li, Peng-Tao Jiang

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper introduces HyperMotionX, a new dataset and benchmark for evaluating pose-guided human image animation of complex motions, along with a DiT-based model that enhances animation quality in dynamic scenarios.

Contribution

It presents a novel DiT-based baseline with a spatial low-frequency enhancement module and introduces the HyperMotionX dataset and benchmark for complex human motion animation evaluation.

Findings

01

Improved structural stability in dynamic human motion sequences

02

Enhanced appearance consistency in animations

03

Effective evaluation of complex motion animation models

Abstract

Recent advances in diffusion models have significantly improved conditional video generation, particularly in the pose-guided human image animation task. Although existing methods are capable of generating high-fidelity and time-consistent animation sequences in regular motions and static scenes. However there are still obvious limitations when facing complex human body motions that contain highly dynamic, non-standard motions, and the lack of a high-quality benchmark for evaluation of complex human motion animations. To address this challenge, we propose a concise yet powerful DiT-based human animation generation baseline and design spatial low-frequency enhanced RoPE, a novel module that selectively enhances low-frequency spatial feature modeling by introducing learnable frequency scaling. Furthermore, we introduce the Open-HyperMotionX Dataset and HyperMotionX Bench, which provide…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vivoCameraResearch/Hyper-Motion
pytorchOfficial

Models

🤗
shuolin/HyperMotion
model· 44 dl· ♡ 9
44 dl♡ 9

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Generative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition

MethodsDiffusion