MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model
Mingyuan Zhang, Zhongang Cai, Liang Pan, Fangzhou Hong, Xinying Guo,, Lei Yang, Ziwei Liu

TL;DR
MotionDiffuse introduces a diffusion model-based framework for text-driven human motion generation, enabling diverse, realistic, and controllable motion synthesis from natural language inputs, surpassing existing methods in quality and flexibility.
Contribution
It is the first diffusion model approach for text-driven human motion generation, offering probabilistic mapping, realistic synthesis, and multi-level manipulation capabilities.
Findings
Outperforms state-of-the-art methods in text-driven motion generation.
Generates diverse and vivid motion sequences.
Provides fine-grained control over body parts and motion length.
Abstract
Human motion modeling is important for many modern graphics applications, which typically require professional skills. In order to remove the skill barriers for laymen, recent motion generation methods can directly generate human motions conditioned on natural languages. However, it remains challenging to achieve diverse and fine-grained motion generation with various text inputs. To address this problem, we propose MotionDiffuse, the first diffusion model-based text-driven motion generation framework, which demonstrates several desired properties over existing methods. 1) Probabilistic Mapping. Instead of a deterministic language-motion mapping, MotionDiffuse generates motions through a series of denoising steps in which variations are injected. 2) Realistic Synthesis. MotionDiffuse excels at modeling complicated data distribution and generating vivid motion sequences. 3) Multi-Level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Human Pose and Action Recognition · 3D Shape Modeling and Analysis
MethodsDiffusion
