A hybrid classification-regression approach for 3D hand pose estimation   using graph convolutional networks

Ikram Kourbane; Yakup Genc

arXiv:2105.10902·cs.CV·May 25, 2021

A hybrid classification-regression approach for 3D hand pose estimation using graph convolutional networks

Ikram Kourbane, Yakup Genc

PDF

TL;DR

This paper introduces a two-stage GCN-based framework for 3D hand pose estimation from RGB images, which learns pose-dependent joint relationships and outperforms existing methods on public datasets.

Contribution

It proposes a novel multi-stage GCN approach that adaptively learns joint relationships, improving accuracy in 3D hand pose estimation from monocular images.

Findings

01

Outperforms state-of-the-art on public datasets

02

Efficient multi-stage GCN model for hand pose estimation

03

Effective learning of pose-dependent joint relationships

Abstract

Hand pose estimation is a crucial part of a wide range of augmented reality and human-computer interaction applications. Predicting the 3D hand pose from a single RGB image is challenging due to occlusion and depth ambiguities. GCN-based (Graph Convolutional Networks) methods exploit the structural relationship similarity between graphs and hand joints to model kinematic dependencies between joints. These techniques use predefined or globally learned joint relationships, which may fail to capture pose-dependent constraints. To address this problem, we propose a two-stage GCN-based framework that learns per-pose relationship constraints. Specifically, the first phase quantizes the 2D/3D space to classify the joints into 2D/3D blocks based on their locality. This spatial dependency information guides this phase to estimate reliable 2D and 3D poses. The second stage further improves the 3D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGraph Convolutional Network