Unifying Human Motion Synthesis and Style Transfer with Denoising Diffusion Probabilistic Models
Ziyi Chang, Edmund J. C. Findlay, Haozheng Zhang, Hubert P. H. Shum

TL;DR
This paper introduces a diffusion probabilistic model that unifies human motion synthesis and style transfer, enabling integrated, high-quality motion generation with a shared latent space.
Contribution
It proposes a novel diffusion model framework that jointly learns motion content and style in a single latent space, unifying synthesis and style transfer tasks.
Findings
Outperforms existing methods in quality and diversity of generated motions
Effectively captures both inter-class content and intra-class style behaviors
Demonstrates the benefits of multi-task diffusion architecture with quantitative and qualitative results
Abstract
Generating realistic motions for digital humans is a core but challenging part of computer animations and games, as human motions are both diverse in content and rich in styles. While the latest deep learning approaches have made significant advancements in this domain, they mostly consider motion synthesis and style manipulation as two separate problems. This is mainly due to the challenge of learning both motion contents that account for the inter-class behaviour and styles that account for the intra-class behaviour effectively in a common representation. To tackle this challenge, we propose a denoising diffusion probabilistic model solution for styled motion synthesis. As diffusion models have a high capacity brought by the injection of stochasticity, we can represent both inter-class motion content and intra-class style behaviour in the same latent. This results in an integrated,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Human Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis
MethodsDiffusion
