GeoHand: Unlocking Prior Geometry Knowledge for Monocular 3D Hand Reconstruction

Weiquan Lin; Yaoqing Hu; Liangchen Dai; Xu Tang; Xingyu Chen

arXiv:2605.17354·cs.CV·May 19, 2026

GeoHand: Unlocking Prior Geometry Knowledge for Monocular 3D Hand Reconstruction

Weiquan Lin, Yaoqing Hu, Liangchen Dai, Xu Tang, Xingyu Chen

PDF

TL;DR

GeoHand leverages prior geometric knowledge and a novel fusion strategy to significantly improve monocular 3D hand reconstruction, especially in occluded and complex scenes.

Contribution

The paper introduces GeoHand, a framework that extracts high-quality geometric priors from a pretrained estimator and adapts them for detailed hand reconstruction using a novel fusion and refinement strategy.

Findings

01

Achieves state-of-the-art results on FreiHAND, DexYCB, and HO3Dv3 datasets.

02

Excels in scenarios with severe occlusions and hand-object interactions.

03

Effectively integrates geometric priors with RGB features for precise reconstruction.

Abstract

Monocular 3D hand reconstruction is intrinsically a geometric problem, yet RGB appearance features alone often struggle to resolve severe ambiguities caused by self-occlusions and hand-object interactions. While introducing depth can explicitly provide spatial cues, raw sensor-captured depth maps are extensively noisy and incomplete, limiting their usefulness for fine-grained hand reconstruction. To bridge this gap, we propose GeoHand, a novel framework that unlocks high-quality geometric priors from a frozen foundational monocular geometry estimator (MoGe2). Recognizing that these priors are oriented toward general scenes, we introduce a map-level GeoAdapter to recalibrate the spatial features, specifically adapting them for detailed hand reconstruction. Furthermore, to systematically integrate these adapted priors without overwhelming intrinsic RGB appearance cues, we employ a gated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.