Neural Sign Actors: A diffusion model for 3D sign language production from text
Vasileios Baltatzis, Rolandos Alexandros Potamias, Evangelos Ververas,, Guanxiong Sun, Jiankang Deng, Stefanos Zafeiriou

TL;DR
This paper introduces a diffusion-based model for generating realistic 3D sign language avatars from text, leveraging a novel graph neural network on the SMPL-X skeleton to improve sign language production.
Contribution
The paper presents a new diffusion model for 3D sign language production using a graph neural network on the SMPL-X skeleton, trained on a large-scale dataset, advancing realism and semantic accuracy.
Findings
Outperforms previous SLP methods quantitatively
Produces realistic dynamic 3D sign language sequences
Bridges communication gap for Deaf and hearing communities
Abstract
Sign Languages (SL) serve as the primary mode of communication for the Deaf and Hard of Hearing communities. Deep learning methods for SL recognition and translation have achieved promising results. However, Sign Language Production (SLP) poses a challenge as the generated motions must be realistic and have precise semantic meaning. Most SLP methods rely on 2D data, which hinders their realism. In this work, a diffusion-based SLP model is trained on a curated large-scale dataset of 4D signing avatars and their corresponding text transcripts. The proposed method can generate dynamic sequences of 3D avatars from an unconstrained domain of discourse using a diffusion process formed on a novel and anatomically informed graph neural network defined on the SMPL-X body skeleton. Through quantitative and qualitative experiments, we show that the proposed method considerably outperforms previous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Gait Recognition and Analysis
MethodsDiffusion · Graph Neural Network
