DiG-Net: Enhancing Human-Robot Interaction through Hyper-Range Dynamic Gesture Recognition in Assistive Robotics
Eran Bamani Beeri, Eden Nissinman, Avishai Sintov

TL;DR
DiG-Net is a novel gesture recognition framework that enables accurate, robust detection of dynamic hand gestures at hyper-range distances up to 30 meters, significantly improving assistive human-robot interaction in challenging environments.
Contribution
This paper introduces DiG-Net, the first framework capable of recognizing dynamic gestures at hyper-range distances, combining novel DADA blocks, Spatio-Temporal Graph modules, and a new loss function for robustness.
Findings
Achieved 97.3% recognition accuracy on a challenging dataset.
Demonstrated robustness under conditions of physical attenuation and low resolution.
Enabled effective gesture recognition up to 30 meters distance.
Abstract
Dynamic hand gestures play a pivotal role in assistive human-robot interaction (HRI), facilitating intuitive, non-verbal communication, particularly for individuals with mobility constraints or those operating robots remotely. Current gesture recognition methods are mostly limited to short-range interactions, reducing their utility in scenarios demanding robust assistive communication from afar. In this paper, we present DiG-Net, the first dynamic gesture recognition framework enabling robust operation at hyper-range distances of up to 30 meters, specifically designed for assistive robotics to enhance accessibility and improve quality of life. Our proposed Distance-aware Gesture Network (DiG-Net) effectively combines Depth-Conditioned Deformable Alignment (DADA) blocks with Spatio-Temporal Graph modules, enabling robust processing and classification of gesture sequences captured under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
