Pose-Guided Fine-Grained Sign Language Video Generation

Tongkai Shi; Lianyu Hu; Fanhua Shang; Jichao Feng; Peidong Liu; Wei; Feng

arXiv:2409.16709·cs.CV·September 26, 2024

Pose-Guided Fine-Grained Sign Language Video Generation

Tongkai Shi, Lianyu Hu, Fanhua Shang, Jichao Feng, Peidong Liu, Wei, Feng

PDF

Open Access

TL;DR

This paper introduces a novel pose-guided motion model for generating high-quality, temporally consistent sign language videos, addressing issues of detail distortion and flickering present in previous methods.

Contribution

The paper presents a new Coarse Motion Module and Pose Fusion Module that improve detail accuracy and temporal consistency in sign language video synthesis.

Findings

01

Outperforms state-of-the-art methods in benchmark tests.

02

Shows significant improvements in detail preservation.

03

Achieves better temporal consistency in generated videos.

Abstract

Sign language videos are an important medium for spreading and learning sign language. However, most existing human image synthesis methods produce sign language images with details that are distorted, blurred, or structurally incorrect. They also produce sign language video frames with poor temporal consistency, with anomalies such as flickering and abrupt detail changes between the previous and next frames. To address these limitations, we propose a novel Pose-Guided Motion Model (PGMM) for generating fine-grained and motion-consistent sign language videos. Firstly, we propose a new Coarse Motion Module (CMM), which completes the deformation of features by optical flow warping, thus transfering the motion of coarse-grained structures without changing the appearance; Secondly, we propose a new Pose Fusion Module (PFM), which guides the modal fusion of RGB and pose features, thus…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Human Motion and Animation