PoseMaster: A Unified 3D Native Framework for Stylized Pose Generation
Hongyu Yan, Kunming Luo, Weiyu Li, Kaiyi Zhang, Yixun Liang, Jingwei Huang, Chunchao Guo, Ping Tan

TL;DR
PoseMaster introduces a unified 3D framework for stylized pose generation that directly uses 3D skeletons, improving accuracy, diversity, and efficiency over traditional cascade methods, and enabling automatic rigging.
Contribution
The paper presents a novel integrated framework for 3D pose stylization that directly leverages 3D skeletons and a large-scale dataset, surpassing existing methods in quality and applicability.
Findings
Outperforms state-of-the-art in qualitative and quantitative metrics.
Enables direct creation of animatable assets with automated skinning.
Improves pose stylization accuracy and diversity.
Abstract
Pose stylization, which aims to synthesize stylized content aligning with target poses, serves as a fundamental task across 2D, 3D, and video domains. In the 3D realm, prevailing approaches typically rely on a cascade pipeline: first manipulating the image pose via 2D foundation models and subsequently lifting it into 3D representations. However, this paradigm limits the precision and diversity of the 3d pose stylization. To this end, we propose a novel paradigm for 3D pose stylization that unifies pose stylization and 3D generation within a cohesive framework. This integration minimizes the risk of cumulative errors and enhances the model's efficiency and effectiveness. In addition, diverging from previous works that typically utilize 2D skeleton images as guidance, we directly utilize the 3D skeleton because it can provide a more accurate representation of 3D spatial and topological…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Human Pose and Action Recognition · 3D Surveying and Cultural Heritage
