Fast Learning of Dynamic Hand Gesture Recognition with Few-Shot Learning Models
Niels Schl\"usener, Michael B\"ucker

TL;DR
This paper presents a Few-Shot Learning approach using Relation Networks with LSTM for rapid dynamic hand gesture recognition, achieving high accuracy with minimal training examples and significantly reducing labeling effort.
Contribution
It introduces a novel Few-Shot Learning model for dynamic hand gesture recognition that requires fewer labeled examples, demonstrating substantial efficiency improvements over traditional methods.
Findings
Achieved up to 88.8% accuracy for five gestures
Achieved up to 81.2% accuracy for ten gestures
Potential savings of up to 1260 observations in training data
Abstract
We develop Few-Shot Learning models trained to recognize five or ten different dynamic hand gestures, respectively, which are arbitrarily interchangeable by providing the model with one, two, or five examples per hand gesture. All models were built in the Few-Shot Learning architecture of the Relation Network (RN), in which Long-Short-Term Memory cells form the backbone. The models use hand reference points extracted from RGB-video sequences of the Jester dataset which was modified to contain 190 different types of hand gestures. Result show accuracy of up to 88.8% for recognition of five and up to 81.2% for ten dynamic hand gestures. The research also sheds light on the potential effort savings of using a Few-Shot Learning approach instead of a traditional Deep Learning approach to detect dynamic hand gestures. Savings were defined as the number of additional observations required when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication
