TL;DR
This paper introduces DeepNAG, a novel gesture synthesis method that trains a generator without a discriminator, achieving faster training and higher realism than traditional GANs, and improves gesture recognition through data augmentation.
Contribution
The paper presents DeepNAG, a new differentiable loss-based approach for gesture synthesis that eliminates the need for a discriminator, enabling faster and more realistic data generation.
Findings
DeepNAG outperforms DeepGAN in accuracy and realism.
DeepNAG reduces training time by up to 17 times.
Synthesized gestures improve recognition performance across datasets.
Abstract
Synthetic data generation to improve classification performance (data augmentation) is a well-studied problem. Recently, generative adversarial networks (GAN) have shown superior image data augmentation performance, but their suitability in gesture synthesis has received inadequate attention. Further, GANs prohibitively require simultaneous generator and discriminator network training. We tackle both issues in this work. We first discuss a novel, device-agnostic GAN model for gesture synthesis called DeepGAN. Thereafter, we formulate DeepNAG by introducing a new differentiable loss function based on dynamic time warping and the average Hausdorff distance, which allows us to train DeepGAN's generator without requiring a discriminator. Through evaluations, we compare the utility of DeepGAN and DeepNAG against two alternative techniques for training five recognizers using data augmentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
