Reconnaissance Automatique des Langues des Signes : Une Approche Hybrid\'ee CNN-LSTM Bas\'ee sur Mediapipe

Fraisse Sacr\'e Takouchouang; Ho Tuong Vinh

arXiv:2510.22011·cs.CV·October 28, 2025

Reconnaissance Automatique des Langues des Signes : Une Approche Hybrid\'ee CNN-LSTM Bas\'ee sur Mediapipe

Fraisse Sacr\'e Takouchouang, Ho Tuong Vinh

PDF

Open Access

TL;DR

This paper presents a real-time automatic sign language recognition system using a hybrid CNN-LSTM model and Mediapipe, achieving 92% accuracy and aiding communication for deaf communities.

Contribution

It introduces a novel hybrid CNN-LSTM architecture combined with Mediapipe for gesture keypoint extraction, enabling real-time sign language translation.

Findings

01

Achieved 92% average accuracy on sign language gestures.

02

Performed well on distinct gestures like 'Hello' and 'Thank you'.

03

Some confusion remains for visually similar gestures such as 'Call' and 'Yes'.

Abstract

Sign languages play a crucial role in the communication of deaf communities, but they are often marginalized, limiting access to essential services such as healthcare and education. This study proposes an automatic sign language recognition system based on a hybrid CNN-LSTM architecture, using Mediapipe for gesture keypoint extraction. Developed with Python, TensorFlow and Streamlit, the system provides real-time gesture translation. The results show an average accuracy of 92\%, with very good performance for distinct gestures such as ``Hello'' and ``Thank you''. However, some confusions remain for visually similar gestures, such as ``Call'' and ``Yes''. This work opens up interesting perspectives for applications in various fields such as healthcare, education and public services.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Interactive and Immersive Displays