Generating Attribute-Aware Human Motions from Textual Prompt

Xinghan Wang; Kun Xu; Fei Li; Cao Sheng; Jiazhong Yu; Yadong Mu

arXiv:2506.21912·cs.CV·November 14, 2025

Generating Attribute-Aware Human Motions from Textual Prompt

Xinghan Wang, Kun Xu, Fei Li, Cao Sheng, Jiazhong Yu, Yadong Mu

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel framework for generating human motions from text that explicitly incorporates human attributes like age and gender, enabling more personalized and accurate motion synthesis.

Contribution

It proposes a new attribute-aware motion generation model based on Structural Causal Models, addressing the gap of attribute influence in text-driven human motion synthesis.

Findings

01

The model effectively decouples action semantics from human attributes.

02

A new dataset with attribute annotations is introduced as a benchmark.

03

Experiments demonstrate the model's ability to generate personalized human motions.

Abstract

Text-driven human motion generation has recently attracted considerable attention, allowing models to generate human motions based on textual descriptions. However, current methods neglect the influence of human attributes-such as age, gender, weight, and height-which are key factors shaping human motion patterns. This work represents a pilot exploration for bridging this gap. We conceptualize each motion as comprising both attribute information and action semantics, where textual descriptions align exclusively with action semantics. To achieve this, a new framework inspired by Structural Causal Models is proposed to decouple action semantics from human attributes, enabling text-to-semantics prediction and attribute-controlled generation. The resulting model is capable of generating attribute-aware motion aligned with the user's text and attribute inputs. For evaluation, we introduce a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Generating Attribute-Aware Human Motions from Textual Prompt· underline

Taxonomy

TopicsHuman Motion and Animation · Multimodal Machine Learning Applications · Social Robot Interaction and HRI