Enhancing Brazilian Sign Language Recognition through Skeleton Image Representation
Carlos Eduardo G. R. Alves, Francisco de Assis Boldt, Thiago M., Paix\~ao

TL;DR
This paper introduces a novel approach for Brazilian Sign Language recognition by converting landmark data into images processed by CNNs, achieving superior accuracy and efficiency over existing methods.
Contribution
The study presents a new skeleton image representation for ISLR that improves accuracy, reduces training complexity, and enhances time efficiency using only RGB data.
Findings
Outperformed state-of-the-art on LIBRAS datasets
Achieved higher accuracy with simpler network architecture
Reduced training time and complexity
Abstract
Effective communication is paramount for the inclusion of deaf individuals in society. However, persistent communication barriers due to limited Sign Language (SL) knowledge hinder their full participation. In this context, Sign Language Recognition (SLR) systems have been developed to improve communication between signing and non-signing individuals. In particular, there is the problem of recognizing isolated signs (Isolated Sign Language Recognition, ISLR) of great relevance in the development of vision-based SL search engines, learning tools, and translation systems. This work proposes an ISLR approach where body, hands, and facial landmarks are extracted throughout time and encoded as 2-D images. These images are processed by a convolutional neural network, which maps the visual-temporal information into a sign label. Experimental results demonstrate that our method surpassed the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Gait Recognition and Analysis
MethodsFocus
