Motion Generation from Fine-grained Textual Descriptions
Kunhang Li, Yansong Feng

TL;DR
This paper introduces a new dataset and model for generating human motion sequences from detailed textual descriptions, significantly improving the quality and diversity of generated motions compared to previous methods.
Contribution
The paper presents a large-scale fine-grained language-motion dataset, FineHumanML3D, and a novel model, FineMotionDiffuse, that leverages detailed descriptions for improved motion generation.
Findings
FineMotionDiffuse improves FID by 0.38 over baselines.
The model outperforms MotionDiffuse in qualitative assessments.
The dataset enables learning from fine-grained textual instructions.
Abstract
The task of text2motion is to generate human motion sequences from given textual descriptions, where the model explores diverse mappings from natural language instructions to human body movements. While most existing works are confined to coarse-grained motion descriptions, e.g., "A man squats.", fine-grained descriptions specifying movements of relevant body parts are barely explored. Models trained with coarse-grained texts may not be able to learn mappings from fine-grained motion-related words to motion primitives, resulting in the failure to generate motions from unseen descriptions. In this paper, we build a large-scale language-motion dataset specializing in fine-grained textual descriptions, FineHumanML3D, by feeding GPT-3.5-turbo with step-by-step instructions with pseudo-code compulsory checks. Accordingly, we design a new text2motion model, FineMotionDiffuse, making full use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Handwritten Text Recognition Techniques
Methods{Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Weight Decay · Layer Normalization · Multi-Head Attention · Cosine Annealing · Softmax · Dropout
