Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production

Shengeng Tang; Jiayi He; Dan Guo; Yanyan Wei; Feng Li; Richang Hong

arXiv:2412.13609·cs.CV·December 20, 2024

Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production

Shengeng Tang, Jiayi He, Dan Guo, Yanyan Wei, Feng Li, Richang Hong

PDF

Open Access 1 Repo 1 Video

TL;DR

Sign-IDD introduces a novel diffusion-based framework for sign language production that disentangles iconicity by modeling bones and attributes, leading to more accurate and natural sign pose generation from text.

Contribution

The paper proposes a pioneering Iconicity Disentanglement module and an Attribute Controllable Diffusion approach for improved sign pose synthesis in sign language production.

Findings

01

Effective in generating natural sign poses

02

Outperforms existing G2P methods on benchmark datasets

03

Enhances pose accuracy and consistency

Abstract

Sign Language Production (SLP) aims to generate semantically consistent sign videos from textual statements, where the conversion from textual glosses to sign poses (G2P) is a crucial step. Existing G2P methods typically treat sign poses as discrete three-dimensional coordinates and directly fit them, which overlooks the relative positional relationships among joints. To this end, we provide a new perspective, constraining joint associations and gesture details by modeling the limb bones to improve the accuracy and naturalness of the generated poses. In this work, we propose a pioneering iconicity disentangled diffusion framework, termed Sign-IDD, specifically designed for SLP. Sign-IDD incorporates a novel Iconicity Disentanglement (ID) module to bridge the gap between relative positions among joints. The ID module disentangles the conventional 3D joint representation into a 4D bone…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

navi-start/sign-idd
pytorchOfficial

Videos

Sign-IDD: Iconicity Disentangled Diffusion for Sign Language Production· underline

Taxonomy

TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication

MethodsDiffusion