Robust 3D Hand Pose Estimation in Single Depth Images: from Single-View   CNN to Multi-View CNNs

Liuhao Ge; Hui Liang; Junsong Yuan; Daniel Thalmann

arXiv:1606.07253·cs.CV·December 28, 2016·60 cites

Robust 3D Hand Pose Estimation in Single Depth Images: from Single-View CNN to Multi-View CNNs

Liuhao Ge, Hui Liang, Junsong Yuan, Daniel Thalmann

PDF

Open Access

TL;DR

This paper introduces a multi-view CNN approach for 3D hand pose estimation from single depth images, projecting images onto three orthogonal planes and fusing heat-maps for improved accuracy and generalization.

Contribution

It proposes a novel multi-view projection and fusion method for hand pose estimation, outperforming existing single-view approaches.

Findings

01

Significantly outperforms state-of-the-art methods on a challenging dataset.

02

Demonstrates strong generalization across different datasets.

03

Utilizes multi-view projections to improve 3D pose accuracy.

Abstract

Articulated hand pose estimation plays an important role in human-computer interaction. Despite the recent progress, the accuracy of existing methods is still not satisfactory, partially due to the difficulty of embedded high-dimensional and non-linear regression problem. Different from the existing discriminative methods that regress for the hand pose with a single depth image, we propose to first project the query depth image onto three orthogonal planes and utilize these multi-view projections to regress for 2D heat-maps which estimate the joint positions on each plane. These multi-view heat-maps are then fused to produce final 3D hand pose estimation with learned pose priors. Experiments show that the proposed method largely outperforms state-of-the-art on a challenging dataset. Moreover, a cross-dataset experiment also demonstrates the good generalization ability of the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Robot Manipulation and Learning