Contact-aware Human Motion Generation from Textual Descriptions
Sihan Ma, Qiong Cao, Jing Zhang, Dacheng Tao

TL;DR
This paper introduces a novel dataset and method for generating realistic 3D human motions from text descriptions that include contact interactions, improving naturalness and plausibility.
Contribution
The paper presents RICH-CAT, a contact-aware motion-text dataset, and CATMO, a new approach that explicitly models human-object contacts for text-driven motion synthesis.
Findings
Outperforms existing text-to-motion methods in stability and contact accuracy
Generates more natural and physically plausible human motions
Enables precise control over contact interactions in generated motions
Abstract
This paper addresses the problem of generating 3D interactive human motion from text. Given a textual description depicting the actions of different body parts in contact with static objects, we synthesize sequences of 3D body poses that are visually natural and physically plausible. Yet, this task poses a significant challenge due to the inadequate consideration of interactions by physical contacts in both motion and textual descriptions, leading to unnatural and implausible sequences. To tackle this challenge, we create a novel dataset named RICH-CAT, representing "Contact-Aware Texts" constructed from the RICH dataset. RICH-CAT comprises high-quality motion, accurate human-object contact labels, and detailed textual descriptions, encompassing over 8,500 motion-text pairs across 26 indoor/outdoor actions. Leveraging RICH-CAT, we propose a novel approach named CATMO for text-driven…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Hand Gesture Recognition Systems
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Residual Connection · Cosine Annealing · Multi-Head Attention · Linear Warmup With Cosine Annealing · Softmax · Discriminative Fine-Tuning
