A$^{2}$V-SLP: Alignment-Aware Variational Modeling for Disentangled Sign Language Production
S\"umeyye Meryem Ta\c{s}y\"urek, Enis M\"ucahid \.Iskender, Hacer Yalim Keles

TL;DR
This paper introduces A$^{2}$V-SLP, a novel alignment-aware variational framework for sign language production that learns disentangled, articulator-specific latent representations to improve motion realism and translation accuracy.
Contribution
It proposes a variational autoencoder with articulator-wise disentangled latent distributions and a gloss attention mechanism for enhanced sign language generation.
Findings
Achieves state-of-the-art back-translation performance
Improves motion realism in sign language synthesis
Outperforms deterministic latent regression methods
Abstract
Building upon recent structural disentanglement frameworks for sign language production, we propose AV-SLP, an alignment-aware variational framework that learns articulator-wise disentangled latent distributions rather than deterministic embeddings. A disentangled Variational Autoencoder (VAE) encodes ground-truth sign pose sequences and extracts articulator-specific mean and variance vectors, which are used as distributional supervision for training a non-autoregressive Transformer. Given text embeddings, the Transformer predicts both latent means and log-variances, while the VAE decoder reconstructs the final sign pose sequences through stochastic sampling at the decoding stage. This formulation maintains articulator-level representations by avoiding deterministic latent collapse through distributional latent modeling. In addition, we integrate a gloss attention mechanism to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Hearing Impairment and Communication
