Object Motion Guided Human Motion Synthesis
Jiaman Li, Jiajun Wu, C. Karen Liu

TL;DR
This paper introduces OMOMO, a diffusion-based framework for synthesizing realistic full-body human manipulation motions guided solely by object motion, explicitly enforcing contact constraints for physical plausibility.
Contribution
The work proposes a novel two-stage diffusion model that predicts hand positions from object motion and then synthesizes full-body poses, improving contact accuracy in human-object interaction synthesis.
Findings
Effective generation of human manipulation motions from object motion.
Explicit contact constraint enforcement improves motion realism.
Model generalizes to unseen objects and scenarios.
Abstract
Modeling human behaviors in contextual environments has a wide range of applications in character animation, embodied AI, VR/AR, and robotics. In real-world scenarios, humans frequently interact with the environment and manipulate various objects to complete daily tasks. In this work, we study the problem of full-body human motion synthesis for the manipulation of large-sized objects. We propose Object MOtion guided human MOtion synthesis (OMOMO), a conditional diffusion framework that can generate full-body manipulation behaviors from only the object motion. Since naively applying diffusion models fails to precisely enforce contact constraints between the hands and the object, OMOMO learns two separate denoising processes to first predict hand positions from object motion and subsequently synthesize full-body poses based on the predicted hand positions. By employing the hand positions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · 3D Shape Modeling and Analysis
MethodsDiffusion
