A hybrid classification-regression approach for 3D hand pose estimation using graph convolutional networks
Ikram Kourbane, Yakup Genc

TL;DR
This paper introduces a two-stage GCN-based framework for 3D hand pose estimation from RGB images, which learns pose-dependent joint relationships and outperforms existing methods on public datasets.
Contribution
It proposes a novel multi-stage GCN approach that adaptively learns joint relationships, improving accuracy in 3D hand pose estimation from monocular images.
Findings
Outperforms state-of-the-art on public datasets
Efficient multi-stage GCN model for hand pose estimation
Effective learning of pose-dependent joint relationships
Abstract
Hand pose estimation is a crucial part of a wide range of augmented reality and human-computer interaction applications. Predicting the 3D hand pose from a single RGB image is challenging due to occlusion and depth ambiguities. GCN-based (Graph Convolutional Networks) methods exploit the structural relationship similarity between graphs and hand joints to model kinematic dependencies between joints. These techniques use predefined or globally learned joint relationships, which may fail to capture pose-dependent constraints. To address this problem, we propose a two-stage GCN-based framework that learns per-pose relationship constraints. Specifically, the first phase quantizes the 2D/3D space to classify the joints into 2D/3D blocks based on their locality. This spatial dependency information guides this phase to estimate reliable 2D and 3D poses. The second stage further improves the 3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGraph Convolutional Network
