Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

Peng Jin; Hao Li; Zesen Cheng; Kehan Li; Runyi Yu; Chang Liu,; Xiangyang Ji; Li Yuan; Jie Chen

arXiv:2407.10528·cs.CV·July 16, 2024

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Runyi Yu, Chang Liu,, Xiangyang Ji, Li Yuan, Jie Chen

PDF

Open Access

TL;DR

This paper introduces a local action-guided diffusion model for text-to-motion generation, enabling more diverse, realistic, and controllable global motions by leveraging local actions as fine-grained control signals.

Contribution

It proposes a novel local action-guided diffusion framework that improves motion diversity and control in text-to-motion synthesis by integrating local actions and guiding weights during the diffusion process.

Findings

01

Effective on HumanML3D and KIT datasets

02

Enhances motion diversity and user control

03

Demonstrates seamless local-global motion integration

Abstract

Text-to-motion generation requires not only grounding local actions in language but also seamlessly blending these individual actions to synthesize diverse and realistic global motions. However, existing motion generation methods primarily focus on the direct synthesis of global motions while neglecting the importance of generating and controlling local actions. In this paper, we propose the local action-guided motion diffusion model, which facilitates global motion generation by utilizing local actions as fine-grained control signals. Specifically, we provide an automated method for reference local action sampling and leverage graph attention networks to assess the guiding weight of each local action in the overall motion synthesis. During the diffusion process for synthesizing global motion, we calculate the local-action gradient to provide conditional guidance. This local-to-global…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Simulation and Modeling Applications · Video Analysis and Summarization

MethodsSoftmax · Attention Is All You Need · Focus · Diffusion