GeoHand: Unlocking Prior Geometry Knowledge for Monocular 3D Hand Reconstruction
Weiquan Lin, Yaoqing Hu, Liangchen Dai, Xu Tang, Xingyu Chen

TL;DR
GeoHand leverages prior geometric knowledge and a novel fusion strategy to significantly improve monocular 3D hand reconstruction, especially in occluded and complex scenes.
Contribution
The paper introduces GeoHand, a framework that extracts high-quality geometric priors from a pretrained estimator and adapts them for detailed hand reconstruction using a novel fusion and refinement strategy.
Findings
Achieves state-of-the-art results on FreiHAND, DexYCB, and HO3Dv3 datasets.
Excels in scenarios with severe occlusions and hand-object interactions.
Effectively integrates geometric priors with RGB features for precise reconstruction.
Abstract
Monocular 3D hand reconstruction is intrinsically a geometric problem, yet RGB appearance features alone often struggle to resolve severe ambiguities caused by self-occlusions and hand-object interactions. While introducing depth can explicitly provide spatial cues, raw sensor-captured depth maps are extensively noisy and incomplete, limiting their usefulness for fine-grained hand reconstruction. To bridge this gap, we propose GeoHand, a novel framework that unlocks high-quality geometric priors from a frozen foundational monocular geometry estimator (MoGe2). Recognizing that these priors are oriented toward general scenes, we introduce a map-level GeoAdapter to recalibrate the spatial features, specifically adapting them for detailed hand reconstruction. Furthermore, to systematically integrate these adapted priors without overwhelming intrinsic RGB appearance cues, we employ a gated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
