DeepHPS: End-to-end Estimation of 3D Hand Pose and Shape by Learning   from Synthetic Depth

Jameel Malik; Ahmed Elhayek; Fabrizio Nunnari; Kiran Varanasi; Kiarash; Tamaddon; Alexis Heloir; Didier Stricker

arXiv:1808.09208·cs.CV·August 29, 2018·6 cites

DeepHPS: End-to-end Estimation of 3D Hand Pose and Shape by Learning from Synthetic Depth

Jameel Malik, Ahmed Elhayek, Fabrizio Nunnari, Kiran Varanasi, Kiarash, Tamaddon, Alexis Heloir, Didier Stricker

PDF

Open Access

TL;DR

DeepHPS introduces a fully supervised deep learning approach for estimating 3D hand pose and shape from a single depth image, utilizing a synthetic dataset and a novel hand model layer for accurate results.

Contribution

The paper presents a new end-to-end deep network with a hand model layer and a large synthetic dataset for improved 3D hand pose and shape estimation.

Findings

01

Achieves state-of-the-art results on NYU and ICVL benchmarks.

02

Demonstrates effective joint training on synthetic and real data.

03

Reconstructs 3D hand mesh and pose in 3.7ms.

Abstract

Articulated hand pose and shape estimation is an important problem for vision-based applications such as augmented reality and animation. In contrast to the existing methods which optimize only for joint positions, we propose a fully supervised deep network which learns to jointly estimate a full 3D hand mesh representation and pose from a single depth image. To this end, a CNN architecture is employed to estimate parametric representations i.e. hand pose, bone scales and complex shape parameters. Then, a novel hand pose and shape layer, embedded inside our deep framework, produces 3D joint positions and hand mesh. Lack of sufficient training data with varying hand shapes limits the generalized performance of learning based methods. Also, manually annotating real data is suboptimal. Therefore, we present SynHand5M: a million-scale synthetic dataset with accurate joint annotations,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Advanced Vision and Imaging