HandFoldingNet: A 3D Hand Pose Estimation Network Using   Multiscale-Feature Guided Folding of a 2D Hand Skeleton

Wencan Cheng; Jae Hyun Park; Jong Hwan Ko

arXiv:2108.05545·cs.CV·August 13, 2021

HandFoldingNet: A 3D Hand Pose Estimation Network Using Multiscale-Feature Guided Folding of a 2D Hand Skeleton

Wencan Cheng, Jae Hyun Park, Jong Hwan Ko

PDF

1 Repo

TL;DR

HandFoldingNet introduces a novel folding-based 3D hand pose estimation model that efficiently regresses joint locations from point clouds, guided by multi-scale features, outperforming existing methods with fewer parameters.

Contribution

The paper presents a new folding-based decoder guided by multi-scale features for accurate and efficient 3D hand pose estimation from point clouds.

Findings

01

Outperforms existing methods on three benchmark datasets.

02

Requires fewer model parameters than comparable approaches.

03

Achieves higher accuracy in hand joint localization.

Abstract

With increasing applications of 3D hand pose estimation in various human-computer interaction applications, convolution neural networks (CNNs) based estimation models have been actively explored. However, the existing models require complex architectures or redundant computational resources to trade with the acceptable accuracy. To tackle this limitation, this paper proposes HandFoldingNet, an accurate and efficient hand pose estimator that regresses the hand joint locations from the normalized 3D hand point cloud input. The proposed model utilizes a folding-based decoder that folds a given 2D hand skeleton into the corresponding joint coordinates. For higher estimation accuracy, folding is guided by multi-scale features, which include both global and joint-wise local features. Experimental results show that the proposed model outperforms the existing methods on three hand pose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cwc1260/handfold
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution