PoseMaster: A Unified 3D Native Framework for Stylized Pose Generation

Hongyu Yan; Kunming Luo; Weiyu Li; Kaiyi Zhang; Yixun Liang; Jingwei Huang; Chunchao Guo; Ping Tan

arXiv:2506.21076·cs.CV·March 24, 2026

PoseMaster: A Unified 3D Native Framework for Stylized Pose Generation

Hongyu Yan, Kunming Luo, Weiyu Li, Kaiyi Zhang, Yixun Liang, Jingwei Huang, Chunchao Guo, Ping Tan

PDF

Open Access

TL;DR

PoseMaster introduces a unified 3D framework for stylized pose generation that directly uses 3D skeletons, improving accuracy, diversity, and efficiency over traditional cascade methods, and enabling automatic rigging.

Contribution

The paper presents a novel integrated framework for 3D pose stylization that directly leverages 3D skeletons and a large-scale dataset, surpassing existing methods in quality and applicability.

Findings

01

Outperforms state-of-the-art in qualitative and quantitative metrics.

02

Enables direct creation of animatable assets with automated skinning.

03

Improves pose stylization accuracy and diversity.

Abstract

Pose stylization, which aims to synthesize stylized content aligning with target poses, serves as a fundamental task across 2D, 3D, and video domains. In the 3D realm, prevailing approaches typically rely on a cascade pipeline: first manipulating the image pose via 2D foundation models and subsequently lifting it into 3D representations. However, this paradigm limits the precision and diversity of the 3d pose stylization. To this end, we propose a novel paradigm for 3D pose stylization that unifies pose stylization and 3D generation within a cohesive framework. This integration minimizes the risk of cumulative errors and enhances the model's efficiency and effectiveness. In addition, diverging from previous works that typically utilize 2D skeleton images as guidance, we directly utilize the 3D skeleton because it can provide a more accurate representation of 3D spatial and topological…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Human Pose and Action Recognition · 3D Surveying and Cultural Heritage