iSign: A Benchmark for Indian Sign Language Processing
Abhinav Joshi, Romit Mohanty, Mounika Kanakanti, Andesha, Mangla, Sudeep Choudhary, Monali Barbate, Ashutosh Modi

TL;DR
iSign introduces a comprehensive benchmark for Indian Sign Language processing, including a large dataset, multiple NLP tasks, and baseline models, to advance research in this under-resourced area.
Contribution
The paper releases the largest ISL-English dataset, proposes multiple NLP tasks, and provides baseline models and linguistic insights for Indian Sign Language processing.
Findings
Largest ISL dataset with 118K video-sentence pairs
Benchmarking of multiple NLP tasks for ISL
Baseline models and linguistic analysis provided
Abstract
Indian Sign Language has limited resources for developing machine learning and data-driven approaches for automated language processing. Though text/audio-based language processing techniques have shown colossal research interest and tremendous improvements in the last few years, Sign Languages still need to catch up due to the need for more resources. To bridge this gap, in this work, we propose iSign: a benchmark for Indian Sign Language (ISL) Processing. We make three primary contributions to this work. First, we release one of the largest ISL-English datasets with more than 118K video-sentence/phrase pairs. To the best of our knowledge, it is the largest sign language dataset available for ISL. Second, we propose multiple NLP-specific tasks (including SignVideo2Text, SignPose2Text, Text2Pose, Word Prediction, and Sign Semantics) and benchmark them with the baseline models for easier…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication
