EvSign: Sign Language Recognition and Translation with Streaming Events
Pengyu Zhang, Hao Yin, Zeren Wang, Wenyue Chen, Shengming, Li, Dong Wang, Huchuan Lu, Xu Jia

TL;DR
This paper introduces EvSign, a new event camera-based dataset and a transformer framework for continuous sign language recognition and translation, demonstrating improved efficiency and performance over existing methods.
Contribution
The work presents the first event-based sign language dataset and a novel transformer model optimized for streaming event data in sign language tasks.
Findings
EvSign dataset provides extensive high-quality event streams with annotations.
The proposed model achieves competitive accuracy with significantly lower computational cost.
Experimental results outperform state-of-the-art methods on multiple datasets.
Abstract
Sign language is one of the most effective communication tools for people with hearing difficulties. Most existing works focus on improving the performance of sign language tasks on RGB videos, which may suffer from degraded recording conditions, such as fast movement of hands with motion blur and textured signer's appearance. The bio-inspired event camera, which asynchronously captures brightness change with high speed, could naturally perceive dynamic hand movements, providing rich manual clues for sign language tasks. In this work, we aim at exploring the potential of event camera in continuous sign language recognition (CSLR) and sign language translation (SLT). To promote the research, we first collect an event-based benchmark EvSign for those tasks with both gloss and spoken language annotations. EvSign dataset offers a substantial amount of high-quality event streams and an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication
MethodsFocus · Surrogate Lagrangian Relaxation
