MCM: Multi-condition Motion Synthesis Framework

Zeyu Ling; Bo Han; Yongkang Wongkan; Han Lin; Mohan Kankanhalli,; Weidong Geng

arXiv:2404.12886·cs.CV·November 19, 2024

MCM: Multi-condition Motion Synthesis Framework

Zeyu Ling, Bo Han, Yongkang Wongkan, Han Lin, Mohan Kankanhalli,, Weidong Geng

PDF

Open Access 1 Repo

TL;DR

This paper introduces MCM, a multi-condition human motion synthesis framework that extends diffusion models to handle both text and audio inputs, enabling high-quality, semantically meaningful motion generation for diverse modalities.

Contribution

The paper presents a novel dual-branch diffusion framework, MCM, incorporating a Transformer-based MWNet to effectively synthesize motion from multiple conditions, including audio and text.

Findings

01

Achieves competitive results in single-condition HMS tasks.

02

Successfully extends diffusion models to multi-condition scenarios.

03

Maintains motion quality and semantic relevance across modalities.

Abstract

Conditional human motion synthesis (HMS) aims to generate human motion sequences that conform to specific conditions. Text and audio represent the two predominant modalities employed as HMS control conditions. While existing research has primarily focused on single conditions, the multi-condition human motion synthesis remains underexplored. In this study, we propose a multi-condition HMS framework, termed MCM, based on a dual-branch structure composed of a main branch and a control branch. This framework effectively extends the applicability of the diffusion model, which is initially predicated solely on textual conditions, to auditory conditions. This extension encompasses both music-to-dance and co-speech HMS while preserving the intrinsic quality of motion and the capabilities for semantic association inherent in the original model. Furthermore, we propose the implementation of a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fluide1022/MCM
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Advanced Vision and Imaging · Robotic Mechanisms and Dynamics

MethodsDiffusion