HOI-Dyn: Learning Interaction Dynamics for Human-Object Motion Diffusion
Lin Wu, Zhixiang Chen, Jianglin Lan

TL;DR
HOI-Dyn introduces a transformer-based framework for generating realistic human-object interactions by modeling interaction dynamics as a driver-responder system, improving plausibility and consistency in 3D HOI synthesis.
Contribution
The paper presents a novel interaction dynamics model and residual loss that explicitly capture human-object responses, enhancing HOI generation quality while maintaining inference efficiency.
Findings
Improved realism in generated 3D human-object interactions.
Establishment of a new metric for evaluating HOI quality.
Demonstrated effectiveness through extensive experiments.
Abstract
Generating realistic 3D human-object interactions (HOIs) remains a challenging task due to the difficulty of modeling detailed interaction dynamics. Existing methods treat human and object motions independently, resulting in physically implausible and causally inconsistent behaviors. In this work, we present HOI-Dyn, a novel framework that formulates HOI generation as a driver-responder system, where human actions drive object responses. At the core of our method is a lightweight transformer-based interaction dynamics model that explicitly predicts how objects should react to human motion. To further enforce consistency, we introduce a residual-based dynamics loss that mitigates the impact of dynamics prediction errors and prevents misleading optimization signals. The dynamics model is used only during training, preserving inference efficiency. Through extensive qualitative and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsHuman Motion and Animation · Human Pose and Action Recognition · Multimodal Machine Learning Applications
