Real-Time Sign Language Gestures to Speech Transcription using Deep Learning

Brandone Fonya; Clarence Worrell

arXiv:2508.12713·cs.CV·February 24, 2026

Real-Time Sign Language Gestures to Speech Transcription using Deep Learning

Brandone Fonya, Clarence Worrell

PDF

Open Access

TL;DR

This paper presents a real-time deep learning-based system that translates sign language gestures into speech, improving communication for individuals with hearing and speech impairments through accurate, fast, and user-friendly technology.

Contribution

The study introduces a novel real-time sign language translation system using CNNs trained on Sign Language MNIST, integrating gesture recognition with speech synthesis for practical communication aid.

Findings

01

High accuracy in gesture classification

02

Robust real-time performance with low latency

03

Effective translation into spoken language

Abstract

Communication barriers pose significant challenges for individuals with hearing and speech impairments, often limiting their ability to effectively interact in everyday environments. This project introduces a real-time assistive technology solution that leverages advanced deep learning techniques to translate sign language gestures into textual and audible speech. By employing convolution neural networks (CNN) trained on the Sign Language MNIST dataset, the system accurately classifies hand gestures captured live via webcam. Detected gestures are instantaneously translated into their corresponding meanings and transcribed into spoken language using text-to-speech synthesis, thus facilitating seamless communication. Comprehensive experiments demonstrate high model accuracy and robust real-time performance with some latency, highlighting the system's practical applicability as an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Speech and dialogue systems