MT-VAE: Learning Motion Transformations to Generate Multimodal Human   Dynamics

Xinchen Yan; Akash Rastogi; Ruben Villegas; Kalyan Sunkavalli; Eli; Shechtman; Sunil Hadap; Ersin Yumer; Honglak Lee

arXiv:1808.04545·cs.LG·August 15, 2018

MT-VAE: Learning Motion Transformations to Generate Multimodal Human Dynamics

Xinchen Yan, Akash Rastogi, Ruben Villegas, Kalyan Sunkavalli, Eli, Shechtman, Sunil Hadap, Ersin Yumer, Honglak Lee

PDF

Open Access 1 Repo

TL;DR

MT-VAE is a novel generative model that learns to produce diverse, plausible human motion sequences by capturing mode transitions, enabling applications like motion transfer and video synthesis.

Contribution

Introduces MT-VAE, a model that jointly learns motion mode embeddings and transition transformations for multimodal human motion generation.

Findings

01

Generates diverse, plausible future human motions.

02

Effective for facial and full body motion.

03

Enables applications like motion transfer and video synthesis.

Abstract

Long-term human motion can be represented as a series of motion modes---motion sequences that capture short-term temporal dynamics---with transitions between them. We leverage this structure and present a novel Motion Transformation Variational Auto-Encoders (MT-VAE) for learning motion sequence generation. Our model jointly learns a feature embedding for motion modes (that the motion sequence can be reconstructed from) and a feature transformation that represents the transition of one motion mode to the next motion mode. Our model is able to generate multiple diverse and plausible motion sequences in the future from the same input. We apply our approach to both facial and full body motion, and demonstrate applications like analogy-based motion transfer and video synthesis.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xcyan/eccv18_mtvae
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Human Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis