DiG-Net: Enhancing Human-Robot Interaction through Hyper-Range Dynamic Gesture Recognition in Assistive Robotics

Eran Bamani Beeri; Eden Nissinman; Avishai Sintov

arXiv:2505.24786·cs.RO·March 17, 2026

DiG-Net: Enhancing Human-Robot Interaction through Hyper-Range Dynamic Gesture Recognition in Assistive Robotics

Eran Bamani Beeri, Eden Nissinman, Avishai Sintov

PDF

TL;DR

DiG-Net is a novel gesture recognition framework that enables accurate, robust detection of dynamic hand gestures at hyper-range distances up to 30 meters, significantly improving assistive human-robot interaction in challenging environments.

Contribution

This paper introduces DiG-Net, the first framework capable of recognizing dynamic gestures at hyper-range distances, combining novel DADA blocks, Spatio-Temporal Graph modules, and a new loss function for robustness.

Findings

01

Achieved 97.3% recognition accuracy on a challenging dataset.

02

Demonstrated robustness under conditions of physical attenuation and low resolution.

03

Enabled effective gesture recognition up to 30 meters distance.

Abstract

Dynamic hand gestures play a pivotal role in assistive human-robot interaction (HRI), facilitating intuitive, non-verbal communication, particularly for individuals with mobility constraints or those operating robots remotely. Current gesture recognition methods are mostly limited to short-range interactions, reducing their utility in scenarios demanding robust assistive communication from afar. In this paper, we present DiG-Net, the first dynamic gesture recognition framework enabling robust operation at hyper-range distances of up to 30 meters, specifically designed for assistive robotics to enhance accessibility and improve quality of life. Our proposed Distance-aware Gesture Network (DiG-Net) effectively combines Depth-Conditioned Deformable Alignment (DADA) blocks with Spatio-Temporal Graph modules, enabling robust processing and classification of gesture sequences captured under…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.