AUTSL: A Large Scale Multi-modal Turkish Sign Language Dataset and Baseline Methods
Ozge Mercanoglu Sincan, Hacer Yalim Keles

TL;DR
This paper introduces AUTSL, a large-scale multi-modal Turkish Sign Language dataset with diverse samples and baseline models, advancing sign language recognition research in real-world settings.
Contribution
The paper presents AUTSL, a comprehensive Turkish Sign Language dataset with multi-modal data and baseline deep learning models for performance benchmarking.
Findings
Baseline models achieved up to 95.95% accuracy on AUTSL.
Models achieved 96.11% accuracy on Montalbano dataset.
Performance gaps highlight challenges in real-world sign language recognition.
Abstract
Sign language recognition is a challenging problem where signs are identified by simultaneous local and global articulations of multiple sources, i.e. hand shape and orientation, hand movements, body posture, and facial expressions. Solving this problem computationally for a large vocabulary of signs in real life settings is still a challenge, even with the state-of-the-art models. In this study, we present a new largescale multi-modal Turkish Sign Language dataset (AUTSL) with a benchmark and provide baseline models for performance evaluations. Our dataset consists of 226 signs performed by 43 different signers and 38,336 isolated sign video samples in total. Samples contain a wide variety of backgrounds recorded in indoor and outdoor environments. Moreover, spatial positions and the postures of signers also vary in the recordings. Each sample is recorded with Microsoft Kinect v2 and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory
