QPoser: Quantized Explicit Pose Prior Modeling for Controllable Pose   Generation

Yumeng Li; Yaoxiang Ding; Zhong Ren; Kun Zhou

arXiv:2312.01104·cs.CV·December 5, 2023·1 cites

QPoser: Quantized Explicit Pose Prior Modeling for Controllable Pose Generation

Yumeng Li, Yaoxiang Ding, Zhong Ren, Kun Zhou

PDF

Open Access

TL;DR

QPoser introduces a controllable explicit pose prior model that guarantees correctness and expressiveness, enabling detailed and conditional human pose generation with improved accuracy over existing methods.

Contribution

The paper proposes QPoser, a novel pose prior model combining multi-head vector quantized autoencoder and global-local feature integration for enhanced controllability, correctness, and expressiveness.

Findings

01

Outperforms state-of-the-art in pose representation accuracy

02

Enables detailed conditional pose generation

03

Maintains physical plausibility of generated poses

Abstract

Explicit pose prior models compress human poses into latent representations for using in pose-related downstream tasks. A desirable explicit pose prior model should satisfy three desirable abilities: 1) correctness, i.e. ensuring to generate physically possible poses; 2) expressiveness, i.e. ensuring to preserve details in generation; 3) controllability, meaning that generation from reference poses and explicit instructions should be convenient. Existing explicit pose prior models fail to achieve all of three properties, in special controllability. To break this situation, we propose QPoser, a highly controllable explicit pose prior model which guarantees correctness and expressiveness. In QPoser, a multi-head vector quantized autoencoder (MS-VQVAE) is proposed for obtaining expressive and distributed pose representations. Furthermore, a global-local feature integration mechanism…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Human Motion and Animation · Multimodal Machine Learning Applications