THOR: Text to Human-Object Interaction Diffusion via Relation Intervention
Qianyang Wu, Ye Shi, Xiaoshui Huang, Jingyi Yu, Lan Xu, Jingya Wang

TL;DR
This paper introduces THOR, a diffusion-based model with relation intervention for generating realistic human-object interactions from text, addressing motion variation, object diversity, and semantic vagueness.
Contribution
We propose a novel Text-guided Human-Object Interaction diffusion model with relation intervention and a new dataset, Text-BEHAVE, for improved Text2HOI generation.
Findings
THOR outperforms existing methods in generating realistic interactions.
Relation intervention improves spatial-temporal consistency.
The dataset enhances training and evaluation of Text2HOI models.
Abstract
This paper addresses new methodologies to deal with the challenging task of generating dynamic Human-Object Interactions from textual descriptions (Text2HOI). While most existing works assume interactions with limited body parts or static objects, our task involves addressing the variation in human motion, the diversity of object shapes, and the semantic vagueness of object motion simultaneously. To tackle this, we propose a novel Text-guided Human-Object Interaction diffusion model with Relation Intervention (THOR). THOR is a cohesive diffusion model equipped with a relation intervention mechanism. In each diffusion step, we initiate text-guided human and object motion and then leverage human-object relations to intervene in object motion. This intervention enhances the spatial-temporal relations between humans and objects, with human-centric interaction representation providing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
MethodsDiffusion
