DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior

Junzhe Lu; Jing Lin; Hongkun Dou; Ailing Zeng; Yue Deng; Xian Liu; Zhongang Cai; Lei Yang; Yulun Zhang; Haoqian Wang; Ziwei Liu

arXiv:2508.00599·cs.CV·August 5, 2025

DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior

Junzhe Lu, Jing Lin, Hongkun Dou, Ailing Zeng, Yue Deng, Xian Liu, Zhongang Cai, Lei Yang, Yulun Zhang, Haoqian Wang, Ziwei Liu

PDF

Open Access 1 Models

TL;DR

DPoser-X is a diffusion-based model that provides a robust and versatile prior for 3D whole-body human poses, effectively handling complex articulated poses and limited data.

Contribution

We introduce DPoser-X, a novel diffusion model for expressive whole-body pose prior modeling, with a new training strategy and improved performance across multiple benchmarks.

Findings

01

Outperforms state-of-the-art methods on various pose benchmarks.

02

Effectively models interdependencies between body parts.

03

Demonstrates robustness and versatility in diverse pose tasks.

Abstract

We present DPoser-X, a diffusion-based prior model for 3D whole-body human poses. Building a versatile and robust full-body human pose prior remains challenging due to the inherent complexity of articulated human poses and the scarcity of high-quality whole-body pose datasets. To address these limitations, we introduce a Diffusion model as body Pose prior (DPoser) and extend it to DPoser-X for expressive whole-body human pose modeling. Our approach unifies various pose-centric tasks as inverse problems, solving them through variational diffusion sampling. To enhance performance on downstream applications, we introduce a novel truncated timestep scheduling method specifically designed for pose data characteristics. We also propose a masked training mechanism that effectively combines whole-body and part-specific datasets, enabling our model to capture interdependencies between body parts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Moon-bow/DPoser-X
model· ♡ 4
♡ 4

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Human Motion and Animation · Robot Manipulation and Learning