EfficientSign: An Attention-Enhanced Lightweight Architecture for Indian Sign Language Recognition
Rishabh Gupta, Shravya R. Nalla

TL;DR
EfficientSign is a lightweight, attention-enhanced model based on EfficientNet-B0 that achieves near state-of-the-art accuracy in Indian Sign Language recognition with significantly fewer parameters, suitable for mobile deployment.
Contribution
The paper introduces EfficientSign, a novel lightweight architecture with attention modules that matches high accuracy while reducing model size for sign language recognition.
Findings
EfficientSign achieves 99.94% accuracy, comparable to ResNet18.
It uses 62% fewer parameters than ResNet18.
Classical classifiers with deep features outperform previous methods.
Abstract
How do you build a sign language recognizer that works on a phone? That question drove this work. We built EfficientSign, a lightweight model which takes EfficientNet-B0 and focuses on two attention modules (Squeeze-and-Excitation for channel focus, and a spatial attention layer that focuses on the hand gestures). We tested it against five other approaches on 12,637 images of Indian Sign Language alphabets, all 26 classes, using 5-fold cross-validation. EfficientSign achieves the accuracy of 99.94% (+/-0.05%), which matches the performance of ResNet18's 99.97% accuracy, but with 62% fewer parameters (4.2M vs 11.2M). We also experimented with feeding deep features (1,280-dimensional vectors pulled from EfficientNet-B0's pooling layer) into classical classifiers. SVM achieved the accuracy of 99.63%, Logistic Regression achieved the accuracy of 99.03% and KNN achieved accuracy of 96.33%.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
