Hamba: Single-view 3D Hand Reconstruction with Graph-guided Bi-Scanning Mamba

Haoye Dong; Aviral Chharia; Wenbo Gou; Francisco Vicente Carrasco; Fernando De la Torre

arXiv:2407.09646·cs.CV·November 27, 2025·2 cites

Hamba: Single-view 3D Hand Reconstruction with Graph-guided Bi-Scanning Mamba

Haoye Dong, Aviral Chharia, Wenbo Gou, Francisco Vicente Carrasco, Fernando De la Torre

PDF

Open Access 1 Repo 1 Video

TL;DR

Hamba introduces a graph-guided bidirectional scanning framework for single-view 3D hand reconstruction, significantly improving accuracy and efficiency over existing attention-based methods by effectively modeling spatial joint relations.

Contribution

The paper proposes the GSS block and a fusion module that leverage graph-guided state space modeling, reducing token usage and enhancing 3D hand reconstruction performance.

Findings

01

Achieves state-of-the-art accuracy with PA-MPVPE of 5.3mm on FreiHAND

02

Uses 88.5% fewer tokens than attention-based methods

03

Ranks 1st in two 3D hand reconstruction leaderboards

Abstract

3D Hand reconstruction from a single RGB image is challenging due to the articulated motion, self-occlusion, and interaction with objects. Existing SOTA methods employ attention-based transformers to learn the 3D hand pose and shape, yet they do not fully achieve robust and accurate performance, primarily due to inefficiently modeling spatial relations between joints. To address this problem, we propose a novel graph-guided Mamba framework, named Hamba, which bridges graph learning and state space modeling. Our core idea is to reformulate Mamba's scanning into graph-guided bidirectional scanning for 3D reconstruction using a few effective tokens. This enables us to efficiently learn the spatial relationships between joints for improving reconstruction performance. Specifically, we design a Graph-guided State Space (GSS) block that learns the graph-structured relations and spatial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

humansensinglab/Hamba
pytorchOfficial

Videos

Hamba: Single-view 3D Hand Reconstruction with Graph-guided Bi-Scanning Mamba· slideslive

Taxonomy

TopicsAnatomy and Medical Technology