Language-Guided Transformer Tokenizer for Human Motion Generation
Sheng Yan, Yong Wang, Xin Du, Junsong Yuan, Mengyuan Liu

TL;DR
This paper introduces LG-Tok, a Transformer-based, language-guided motion tokenizer that produces compact, semantic motion representations, improving generation quality and efficiency in human motion synthesis tasks.
Contribution
We propose a novel language-guided motion tokenization method using Transformer architecture, enhancing motion representation and generative model learning efficiency.
Findings
LG-Tok outperforms state-of-the-art methods on HumanML3D and Motion-X benchmarks.
LG-Tok-mini achieves comparable performance with fewer tokens.
The language-drop scheme enables language-free motion generation.
Abstract
In this paper, we focus on motion discrete tokenization, which converts raw motion into compact discrete tokens--a process proven crucial for efficient motion generation. In this paradigm, increasing the number of tokens is a common approach to improving motion reconstruction quality, but more tokens make it more difficult for generative models to learn. To maintain high reconstruction quality while reducing generation complexity, we propose leveraging language to achieve efficient motion tokenization, which we term Language-Guided Tokenization (LG-Tok). LG-Tok aligns natural language with motion at the tokenization stage, yielding compact, high-level semantic representations. This approach not only strengthens both tokenization and detokenization but also simplifies the learning of generative models. Furthermore, existing tokenizers predominantly adopt convolutional architectures,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Generative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis
