Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis
Zeyi Zhang, Tenglong Ao, Yuyao Zhang, Qingzhe Gao, Chuan Lin, Baoquan Chen, Libin Liu

TL;DR
Semantic Gesticulator introduces a novel framework that synthesizes semantically meaningful gestures aligned with speech by combining large language models, a motion library, and a semantic alignment mechanism, improving naturalness and semantic accuracy.
Contribution
The paper presents a new generative retrieval framework using a GPT-based model and a semantic alignment mechanism for more accurate and natural co-speech gesture synthesis.
Findings
Outperforms state-of-the-art in semantic appropriateness
Generates rhythmically coherent gestures
Robust and natural gesture synthesis
Abstract
In this work, we present Semantic Gesticulator, a novel framework designed to synthesize realistic gestures accompanying speech with strong semantic correspondence. Semantically meaningful gestures are crucial for effective non-verbal communication, but such gestures often fall within the long tail of the distribution of natural human motion. The sparsity of these movements makes it challenging for deep learning-based systems, trained on moderately sized datasets, to capture the relationship between the movements and the corresponding speech semantics. To address this challenge, we develop a generative retrieval framework based on a large language model. This framework efficiently retrieves suitable semantic gesture candidates from a motion library in response to the input speech. To construct this motion library, we summarize a comprehensive list of commonly used semantic gestures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Robotics and Automated Systems · Hand Gesture Recognition Systems
MethodsLib · ALIGN
