Classifying In-Place Gestures with End-to-End Point Cloud Learning
Lizhi Zhao, Xuequan Lu, Min Zhao, Meili Wang

TL;DR
This paper introduces a novel end-to-end point cloud learning framework for accurately classifying in-place gestures in virtual environments, achieving high accuracy and low latency.
Contribution
It proposes a new point cloud-based neural network framework that combines supervised learning with unsupervised domain adaptation for gesture classification.
Findings
Achieved 95% classification accuracy.
Real-time performance with 192ms latency.
Effective handling of inter-person variations.
Abstract
Walking in place for moving through virtual environments has attracted noticeable attention recently. Recent attempts focused on training a classifier to recognize certain patterns of gestures (e.g., standing, walking, etc) with the use of neural networks like CNN or LSTM. Nevertheless, they often consider very few types of gestures and/or induce less desired latency in virtual environments. In this paper, we propose a novel framework for accurate and efficient classification of in-place gestures. Our key idea is to treat several consecutive frames as a "point cloud". The HMD and two VIVE trackers provide three points in each frame, with each point consisting of 12-dimensional features (i.e., three-dimensional position coordinates, velocity, rotation, angular velocity). We create a dataset consisting of 9 gesture classes for virtual in-place locomotion. In addition to the supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Hand Gesture Recognition Systems
