iSign: A Benchmark for Indian Sign Language Processing

Abhinav Joshi; Romit Mohanty; Mounika Kanakanti; Andesha; Mangla; Sudeep Choudhary; Monali Barbate; Ashutosh Modi

arXiv:2407.05404·cs.CL·July 9, 2024

iSign: A Benchmark for Indian Sign Language Processing

Abhinav Joshi, Romit Mohanty, Mounika Kanakanti, Andesha, Mangla, Sudeep Choudhary, Monali Barbate, Ashutosh Modi

PDF

Open Access 1 Datasets 1 Video

TL;DR

iSign introduces a comprehensive benchmark for Indian Sign Language processing, including a large dataset, multiple NLP tasks, and baseline models, to advance research in this under-resourced area.

Contribution

The paper releases the largest ISL-English dataset, proposes multiple NLP tasks, and provides baseline models and linguistic insights for Indian Sign Language processing.

Findings

01

Largest ISL dataset with 118K video-sentence pairs

02

Benchmarking of multiple NLP tasks for ISL

03

Baseline models and linguistic analysis provided

Abstract

Indian Sign Language has limited resources for developing machine learning and data-driven approaches for automated language processing. Though text/audio-based language processing techniques have shown colossal research interest and tremendous improvements in the last few years, Sign Languages still need to catch up due to the need for more resources. To bridge this gap, in this work, we propose iSign: a benchmark for Indian Sign Language (ISL) Processing. We make three primary contributions to this work. First, we release one of the largest ISL-English datasets with more than 118K video-sentence/phrase pairs. To the best of our knowledge, it is the largest sign language dataset available for ISL. Second, we propose multiple NLP-specific tasks (including SignVideo2Text, SignPose2Text, Text2Pose, Word Prediction, and Sign Semantics) and benchmark them with the baseline models for easier…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Exploration-Lab/iSign
dataset· 315 dl
315 dl

Videos

iSign: A Benchmark for Indian Sign Language Processing· underline

Taxonomy

TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication