Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition
Benjia Zhou, Yunan Li, Jun Wan

TL;DR
This paper introduces RAAR3DNet, a novel neural network for RGB-D gesture recognition that adaptively focuses on hand/arm regions using a neural architecture search and a dynamic-static attention module, outperforming existing methods.
Contribution
It proposes an architecture-rebuilt 3D network with a regional attention module, improving gesture recognition by focusing on relevant regions and adaptively learning feature representations.
Findings
Outperforms state-of-the-art gesture recognition methods
Effectively highlights hand/arm regions and motion information
Validated on large-scale RGB-D datasets
Abstract
Human gesture recognition has drawn much attention in the area of computer vision. However, the performance of gesture recognition is always influenced by some gesture-irrelevant factors like the background and the clothes of performers. Therefore, focusing on the regions of hand/arm is important to the gesture recognition. Meanwhile, a more adaptive architecture-searched network structure can also perform better than the block-fixed ones like Resnet since it increases the diversity of features in different stages of the network better. In this paper, we propose a regional attention with architecture-rebuilt 3D network (RAAR3DNet) for gesture recognition. We replace the fixed Inception modules with the automatically rebuilt structure through the network via Neural Architecture Search (NAS), owing to the different shape and representation ability of features in the early, middle, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Gait Recognition and Analysis
Methods1x1 Convolution · Average Pooling · Kaiming Initialization · Batch Normalization · Max Pooling · Residual Connection · Global Average Pooling · Bottleneck Residual Block · Convolution · *Communicated@Fast*How Do I Communicate to Expedia?
