HandDGP: Camera-Space Hand Mesh Prediction with Differentiable Global Positioning
Eugene Valassakis, Guillermo Garcia-Hernando

TL;DR
This paper introduces HandDGP, an end-to-end framework that predicts camera-space hand meshes from single RGB images by unifying previous separate stages and incorporating a differentiable global positioning module, improving accuracy and consistency.
Contribution
The main novelty is the end-to-end unification of hand mesh prediction and camera-space positioning using a differentiable module, along with an image rectification step to address scale-depth ambiguity.
Findings
Outperforms previous methods on three public benchmarks.
Effectively preserves contextual and scale information during prediction.
Reduces scale-depth ambiguity through image rectification.
Abstract
Predicting camera-space hand meshes from single RGB images is crucial for enabling realistic hand interactions in 3D virtual and augmented worlds. Previous work typically divided the task into two stages: given a cropped image of the hand, predict meshes in relative coordinates, followed by lifting these predictions into camera space in a separate and independent stage, often resulting in the loss of valuable contextual and scale information. To prevent the loss of these cues, we propose unifying these two stages into an end-to-end solution that addresses the 2D-3D correspondence problem. This solution enables back-propagation from camera space outputs to the rest of the network through a new differentiable global positioning module. We also introduce an image rectification step that harmonizes both the training dataset and the input image as if they were acquired with the same camera,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Ergonomics and Musculoskeletal Disorders · Human Pose and Action Recognition
