InterControl: Zero-shot Human Interaction Generation by Controlling Every Joint
Zhenzhi Wang, Jingbo Wang, Yixuan Li, Dahua Lin, Bo Dai

TL;DR
InterControl is a novel zero-shot motion synthesis framework that enables the generation of multi-human interactions with controllable joint distances, flexible to any number of characters, and guided by language models and inverse kinematics.
Contribution
We propose a flexible, zero-shot motion generation method for multi-human interactions that does not require training on multi-character datasets, utilizing joint pair control and language model guidance.
Findings
Successfully generates multi-human interactions with arbitrary group sizes.
Maintains desired joint distances in synthesized motions.
Works effectively with physics-based character simulators.
Abstract
Text-conditioned motion synthesis has made remarkable progress with the emergence of diffusion models. However, the majority of these motion diffusion models are primarily designed for a single character and overlook multi-human interactions. In our approach, we strive to explore this problem by synthesizing human motion with interactions for a group of characters of any size in a zero-shot manner. The key aspect of our approach is the adaptation of human-wise interactions as pairs of human joints that can be either in contact or separated by a desired distance. In contrast to existing methods that necessitate training motion generation models on multi-human motion datasets with a fixed number of characters, our approach inherently possesses the flexibility to model human interactions involving an arbitrary number of individuals, thereby transcending the limitations imposed by the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHuman Motion and Animation · Human Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis
MethodsDiffusion · ALIGN
