LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction
Xinhan Di, Pengqian Yu

TL;DR
This paper introduces LWA-HAND, a lightweight, efficient model for real-time two-hand reconstruction from a single RGB image, addressing occlusion and interaction challenges with novel attention modules.
Contribution
The paper presents three innovative mobile attention modules that enable low-flops, accurate two-hand reconstruction, a novel approach in efficient transformer-based hand modeling.
Findings
Achieves comparable accuracy to state-of-the-art models on InterHand2.6M.
Reduces computational cost to 0.47 GFlops from 10-20 GFlops.
Effectively handles occlusion and interaction in hand reconstruction.
Abstract
Recent years have witnessed great success for hand reconstruction in real-time applications such as visual reality and augmented reality while interacting with two-hand reconstruction through efficient transformers is left unexplored. In this paper, we propose a method called lightweight attention hand (LWA-HAND) to reconstruct hands in low flops from a single RGB image. To solve the occlusion and interaction problem in efficient attention architectures, we propose three mobile attention modules in this paper. The first module is a lightweight feature attention module that extracts both local occlusion representation and global image patch representation in a coarse-to-fine manner. The second module is a cross image and graph bridge module which fuses image context and hand vertex. The third module is a lightweight cross-attention mechanism that uses element-wise operation for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Hand Gesture Recognition Systems · Medical Imaging and Analysis
