Synthetic Video Generation for Robust Hand Gesture Recognition in Augmented Reality Applications
Varun Jain, Shivam Aggarwal, Suril Mehta, Ramya Hebbalaguppe

TL;DR
This paper introduces a framework for generating synthetic, labeled, photo-realistic videos of hand gestures to improve training and benchmarking of gesture recognition models in AR/VR, reducing dependence on costly sensors.
Contribution
The authors present a novel framework for creating diverse, labeled synthetic videos of hand gestures to aid in training and benchmarking AR/VR gesture recognition systems.
Findings
Generated videos are diverse and photo-realistic.
Framework effectively provides labeled data for model training.
Synthetic data improves gesture recognition performance.
Abstract
Hand gestures are a natural means of interaction in Augmented Reality and Virtual Reality (AR/VR) applications. Recently, there has been an increased focus on removing the dependence of accurate hand gesture recognition on complex sensor setup found in expensive proprietary devices such as the Microsoft HoloLens, Daqri and Meta Glasses. Most such solutions either rely on multi-modal sensor data or deep neural networks that can benefit greatly from abundance of labelled data. Datasets are an integral part of any deep learning based research. They have been the principal reason for the substantial progress in this field, both, in terms of providing enough data for the training of these models, and, for benchmarking competing algorithms. However, it is becoming increasingly difficult to generate enough labelled data for complex tasks such as hand gesture recognition. The goal of this work…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Advanced Vision and Imaging
