Snapture -- A Novel Neural Architecture for Combined Static and Dynamic Hand Gesture Recognition
Hassan Ali, Doreen Jirak, Stefan Wermter

TL;DR
This paper introduces Snapture, a hybrid neural architecture that effectively recognizes both static and dynamic hand gestures by capturing snapshots and analyzing motion profiles, enhancing human-robot interaction capabilities.
Contribution
The novel architecture combines static and dynamic gesture recognition with motion analysis, enabling improved performance and modular integration of multimodal cues in HRI scenarios.
Findings
Outperforms CNNLSTM baseline on two gesture benchmarks.
Analyzes gesture motion profiles to improve recognition accuracy.
Modular design allows integration of facial expressions and head tracking.
Abstract
As robots are expected to get more involved in people's everyday lives, frameworks that enable intuitive user interfaces are in demand. Hand gesture recognition systems provide a natural way of communication and, thus, are an integral part of seamless Human-Robot Interaction (HRI). Recent years have witnessed an immense evolution of computational models powered by deep learning. However, state-of-the-art models fall short in expanding across different gesture domains, such as emblems and co-speech. In this paper, we propose a novel hybrid hand gesture recognition system. Our architecture enables learning both static and dynamic gestures: by capturing a so-called "snapshot" of the gesture performance at its peak, we integrate the hand pose along with the dynamic movement. Moreover, we present a method for analyzing the motion profile of a gesture to uncover its dynamic characteristics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Robot Manipulation and Learning
