Event Stream-based Sign Language Translation: A High-Definition Benchmark Dataset and A Novel Baseline
Shiao Wang, Xiao Wang, Duoqing Yang, Yao Rong, Fuling Wang, Jianing Li, Lin Zhu, Bo Jiang

TL;DR
This paper introduces a high-definition event-based sign language dataset and a novel translation framework, addressing limitations of traditional video-based methods and advancing AI-assisted sign language translation.
Contribution
It presents a new event-based sign language dataset, Event-CSL, and a novel translation model, EvSLT, improving robustness and performance in sign language translation tasks.
Findings
Event-CSL dataset contains 14,827 videos and covers diverse conditions.
EvSLT outperforms existing methods on multiple datasets.
The approach enhances translation accuracy and robustness.
Abstract
Sign Language Translation (SLT) is a core task in the field of AI-assisted disability. Traditional SLT methods are typically based on visible light videos, which are easily affected by factors such as lighting variations, rapid hand movements, and privacy concerns. This paper proposes the use of bio-inspired event cameras to alleviate the aforementioned issues. Specifically, we introduce a new high-definition event-based sign language dataset, termed Event-CSL, which effectively addresses the data scarcity in this research area. The dataset comprises 14,827 videos, 14,821 glosses, and 2,544 Chinese words in the text vocabulary. These samples are collected across diverse indoor and outdoor scenes, covering multiple viewpoints, lighting conditions, and camera motions. We have also benchmarked existing mainstream SLT methods on this dataset to facilitate fair comparisons in future…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Robotics and Automated Systems · Anomaly Detection Techniques and Applications
MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces
