Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset
Jing Lin, Ailing Zeng, Shunlin Lu, Yuanhao Cai, Ruimao Zhang, Haoqian, Wang, Lei Zhang

TL;DR
Motion-X is a large-scale 3D whole-body human motion dataset with detailed annotations, enabling advances in expressive motion generation and 3D human mesh recovery.
Contribution
We developed a scalable, high-precision annotation pipeline and created Motion-X, a comprehensive dataset with 15.6 million 3D pose annotations and semantic labels.
Findings
Annotation pipeline is highly accurate and cost-effective.
Motion-X enhances expressive and diverse motion generation.
Improves 3D human mesh recovery methods.
Abstract
In this paper, we present Motion-X, a large-scale 3D expressive whole-body motion dataset. Existing motion datasets predominantly contain body-only poses, lacking facial expressions, hand gestures, and fine-grained pose descriptions. Moreover, they are primarily collected from limited laboratory scenes with textual descriptions manually labeled, which greatly limits their scalability. To overcome these limitations, we develop a whole-body motion and text annotation pipeline, which can automatically annotate motion from either single- or multi-view videos and provide comprehensive semantic labels for each video and fine-grained whole-body pose descriptions for each frame. This pipeline is of high precision, cost-effective, and scalable for further research. Based on it, we construct Motion-X, which comprises 15.6M precise 3D whole-body pose annotations (i.e., SMPL-X) covering 81.1K…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Video Analysis and Summarization
