MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

Mingyuan Zhang; Zhongang Cai; Liang Pan; Fangzhou Hong; Xinying Guo,; Lei Yang; Ziwei Liu

arXiv:2208.15001·cs.CV·September 1, 2022·110 cites

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

Mingyuan Zhang, Zhongang Cai, Liang Pan, Fangzhou Hong, Xinying Guo,, Lei Yang, Ziwei Liu

PDF

Open Access 2 Repos

TL;DR

MotionDiffuse introduces a diffusion model-based framework for text-driven human motion generation, enabling diverse, realistic, and controllable motion synthesis from natural language inputs, surpassing existing methods in quality and flexibility.

Contribution

It is the first diffusion model approach for text-driven human motion generation, offering probabilistic mapping, realistic synthesis, and multi-level manipulation capabilities.

Findings

01

Outperforms state-of-the-art methods in text-driven motion generation.

02

Generates diverse and vivid motion sequences.

03

Provides fine-grained control over body parts and motion length.

Abstract

Human motion modeling is important for many modern graphics applications, which typically require professional skills. In order to remove the skill barriers for laymen, recent motion generation methods can directly generate human motions conditioned on natural languages. However, it remains challenging to achieve diverse and fine-grained motion generation with various text inputs. To address this problem, we propose MotionDiffuse, the first diffusion model-based text-driven motion generation framework, which demonstrates several desired properties over existing methods. 1) Probabilistic Mapping. Instead of a deterministic language-motion mapping, MotionDiffuse generates motions through a series of denoising steps in which variations are injected. 2) Realistic Synthesis. MotionDiffuse excels at modeling complicated data distribution and generating vivid motion sequences. 3) Multi-Level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Human Pose and Action Recognition · 3D Shape Modeling and Analysis

MethodsDiffusion