TL;DR
MoDi is a novel generative model capable of unconditionally synthesizing diverse, high-quality motions from unstructured data, with a well-structured latent space enabling semantic editing and inversion.
Contribution
We introduce MoDi, an unsupervised motion synthesis model that learns from highly diverse, unlabeled data, producing state-of-the-art results and enabling semantic motion editing and inversion.
Findings
Achieves state-of-the-art motion synthesis quality.
Latent space is semantically meaningful and well-structured.
Enables effective motion editing and inversion tasks.
Abstract
The emergence of neural networks has revolutionized the field of motion synthesis. Yet, learning to unconditionally synthesize motions from a given distribution remains challenging, especially when the motions are highly diverse. In this work, we present MoDi -- a generative model trained in an unsupervised setting from an extremely diverse, unstructured and unlabeled dataset. During inference, MoDi can synthesize high-quality, diverse motions. Despite the lack of any structure in the dataset, our model yields a well-behaved and highly structured latent space, which can be semantically clustered, constituting a strong motion prior that facilitates various applications including semantic editing and crowd simulation. In addition, we present an encoder that inverts real motions into MoDi's natural motion manifold, issuing solutions to various ill-posed challenges such as completion from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsHuMan(Expedia)||How do I get a human at Expedia? · Dense Connections · Feedforward Network · Convolution · R1 Regularization · Adaptive Instance Normalization
