Interaction Mix and Match: Synthesizing Close Interaction using Conditional Hierarchical GAN with Multi-Hot Class Embedding
Aman Goel, Qianhui Men, Edmond S. L. Ho

TL;DR
This paper introduces a novel Conditional Hierarchical GAN with Multi-Hot Class Embedding to synthesize diverse, realistic multi-character interactions, including unseen motions, outperforming existing methods on multiple datasets.
Contribution
It presents a new generative model that enables mixing and matching different close interactions to produce realistic reactive motions not present in the training data.
Findings
Outperforms state-of-the-art methods on multiple datasets
Generates realistic reactive motions including unseen types
Provides an augmented dataset for future research
Abstract
Synthesizing multi-character interactions is a challenging task due to the complex and varied interactions between the characters. In particular, precise spatiotemporal alignment between characters is required in generating close interactions such as dancing and fighting. Existing work in generating multi-character interactions focuses on generating a single type of reactive motion for a given sequence which results in a lack of variety of the resultant motions. In this paper, we propose a novel way to create realistic human reactive motions which are not presented in the given dataset by mixing and matching different types of close interactions. We propose a Conditional Hierarchical Generative Adversarial Network with Multi-Hot Class Embedding to generate the Mix and Match reactive motions of the follower from a given motion sequence of the leader. Experiments are conducted on both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Human Pose and Action Recognition · Video Analysis and Summarization
