Temporal superimposed crossover module for effective continuous sign language
Qidan Zhu, Jing Li, Fei Yuan, Quan Gan

TL;DR
This paper introduces a zero-parameter, low-cost temporal superimposed crossover module combined with 2D convolution to enhance real-time continuous sign language recognition, extending ResNet for video recognition with improved efficiency.
Contribution
It proposes a novel zero-parameter temporal module integrated with 2D convolution, enabling effective spatial-temporal modeling in CSLR with reduced computational cost.
Findings
Achieves competitive recognition accuracy on large-scale datasets.
Reduces training memory usage compared to existing methods.
Extends ResNet for end-to-end sign language video recognition.
Abstract
The ultimate goal of continuous sign language recognition(CSLR) is to facilitate the communication between special people and normal people, which requires a certain degree of real-time and deploy-ability of the model. However, in the previous research on CSLR, little attention has been paid to the real-time and deploy-ability. In order to improve the real-time and deploy-ability of the model, this paper proposes a zero parameter, zero computation temporal superposition crossover module(TSCM), and combines it with 2D convolution to form a "TSCM+2D convolution" hybrid convolution, which enables 2D convolution to have strong spatial-temporal modelling capability with zero parameter increase and lower deployment cost compared with other spatial-temporal convolutions. The overall CSLR model based on TSCM is built on the improved ResBlockT network in this paper. The hybrid convolution of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Gait Recognition and Analysis
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · 1x1 Convolution · Max Pooling · Kaiming Initialization · Global Average Pooling · Residual Block · Batch Normalization · Residual Connection · Bottleneck Residual Block
