Augmented Skeleton Space Transfer for Depth-based Hand Pose Estimation
Seungryul Baek, Kwang In Kim, Tae-Kyun Kim

TL;DR
This paper introduces a novel method for depth-based 3D hand pose estimation that synthesizes data in skeleton space and employs a unified GAN framework to improve robustness and accuracy across diverse datasets.
Contribution
It proposes a skeleton space data augmentation technique combined with a hand pose generator and discriminator within a GAN framework for enhanced hand pose estimation.
Findings
Outperforms state-of-the-art on four benchmark datasets.
Generates realistic hand pose data in skeleton space.
Refines pose estimates using a learned discriminator.
Abstract
Crucial to the success of training a depth-based 3D hand pose estimator (HPE) is the availability of comprehensive datasets covering diverse camera perspectives, shapes, and pose variations. However, collecting such annotated datasets is challenging. We propose to complete existing databases by generating new database entries. The key idea is to synthesize data in the skeleton space (instead of doing so in the depth-map space) which enables an easy and intuitive way of manipulating data entries. Since the skeleton entries generated in this way do not have the corresponding depth map entries, we exploit them by training a separate hand pose generator (HPG) which synthesizes the depth map from the skeleton entries. By training the HPG and HPE in a single unified optimization framework enforcing that 1) the HPE agrees with the paired depth and skeleton entries; and 2) the HPG-HPE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Robot Manipulation and Learning
