A Transformer-Based Multi-Stream Approach for Isolated Iranian Sign Language Recognition
Ali Ghadami, Alireza Taheri, Ali Meghdari

TL;DR
This paper presents a transformer-based multi-stream model for recognizing Iranian Sign Language, achieving over 90% accuracy and enabling real-time feedback to improve sign language learning and communication for the deaf community.
Contribution
It introduces a novel multi-stream transformer approach with genetic algorithm optimization and multi-task learning for Iranian Sign Language recognition.
Findings
Achieved 90.2% accuracy on test data.
Demonstrated effective real-time sign language recognition.
Showed positive impact of feedback in sign language learning software.
Abstract
Sign language is an essential means of communication for millions of people around the world and serves as their primary language. However, most communication tools are developed for spoken and written languages which can cause problems and difficulties for the deaf and hard of hearing community. By developing a sign language recognition system, we can bridge this communication gap and enable people who use sign language as their main form of expression to better communicate with people and their surroundings. This recognition system increases the quality of health services, improves public services, and creates equal opportunities for the deaf community. This research aims to recognize Iranian Sign Language words with the help of the latest deep learning tools such as transformers. The dataset used includes 101 Iranian Sign Language words frequently used in academic environments such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Gait Recognition and Analysis
