Human Motion Diffusion Model
Guy Tevet, Sigal Raab, Brian Gordon, Yonatan Shafir, Daniel Cohen-Or, and Amit H. Bermano

TL;DR
The paper introduces Motion Diffusion Model (MDM), a transformer-based diffusion approach for human motion generation that achieves state-of-the-art results with lightweight training resources and flexible conditioning capabilities.
Contribution
It presents a novel diffusion model for human motion that predicts samples directly, enabling effective geometric loss application and versatile conditioning for various motion tasks.
Findings
Achieves state-of-the-art results on text-to-motion benchmarks.
Uses lightweight training resources compared to existing models.
Supports multiple conditioning modes and motion generation tasks.
Abstract
Natural and expressive human motion generation is the holy grail of computer animation. It is a challenging task, due to the diversity of possible motion, human perceptual sensitivity to it, and the difficulty of accurately describing it. Therefore, current generative solutions are either low-quality or limited in expressiveness. Diffusion models, which have already shown remarkable generative capabilities in other domains, are promising candidates for human motion due to their many-to-many nature, but they tend to be resource hungry and hard to control. In this paper, we introduce Motion Diffusion Model (MDM), a carefully adapted classifier-free diffusion-based generative model for the human motion domain. MDM is transformer-based, combining insights from motion generation literature. A notable design-choice is the prediction of the sample, rather than the noise, in each diffusion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Generative Adversarial Networks and Image Synthesis
MethodsDiffusion
