In-Place Gestures Classification via Long-term Memory Augmented Network
Lizhi Zhao, Xuequan Lu, Qianyue Bao, Meili Wang

TL;DR
This paper introduces a long-term memory augmented network for in-place gesture classification that leverages both short-term and long-term sequence data to improve accuracy and real-time performance in virtual locomotion control.
Contribution
It proposes a novel memory-augmented neural network that incorporates long-term sequence features during training and inference for enhanced gesture recognition accuracy.
Findings
Achieved 95.1% accuracy with 192ms latency.
Achieved 97.3% accuracy with 312ms latency.
Outperformed recent gesture classification methods.
Abstract
In-place gesture-based virtual locomotion techniques enable users to control their viewpoint and intuitively move in the 3D virtual environment. A key research problem is to accurately and quickly recognize in-place gestures, since they can trigger specific movements of virtual viewpoints and enhance user experience. However, to achieve real-time experience, only short-term sensor sequence data (up to about 300ms, 6 to 10 frames) can be taken as input, which actually affects the classification performance due to limited spatio-temporal information. In this paper, we propose a novel long-term memory augmented network for in-place gestures classification. It takes as input both short-term gesture sequence samples and their corresponding long-term sequence samples that provide extra relevant spatio-temporal information in the training phase. We store long-term sequence features with an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Human Motion and Animation
