A Simple Baseline for Efficient Hand Mesh Reconstruction
Zhishan Zhou, Shihao.zhou, Zhi Lv, Minqiang Zou, Yao Tang, Jiajun, Liang

TL;DR
This paper introduces a simple, efficient baseline for 3D hand mesh reconstruction that outperforms state-of-the-art methods in accuracy and speed, while maintaining modularity and ease of integration.
Contribution
The authors propose a decoupled, core-structure-based approach with a token generator and mesh regressor that achieves SOTA results across multiple datasets.
Findings
Achieves 5.7mm PA-MPJPE on FreiHAND
Reaches 70 fps with FastViT-MA36
Outperforms existing methods in accuracy and efficiency
Abstract
3D hand pose estimation has found broad application in areas such as gesture recognition and human-machine interaction tasks. As performance improves, the complexity of the systems also increases, which can limit the comparative analysis and practical implementation of these methods. In this paper, we propose a simple yet effective baseline that not only surpasses state-of-the-art (SOTA) methods but also demonstrates computational efficiency. To establish this baseline, we abstract existing work into two components: a token generator and a mesh regressor, and then examine their core structures. A core structure, in this context, is one that fulfills intrinsic functions, brings about significant improvements, and achieves excellent performance without unnecessary complexities. Our proposed approach is decoupled from any modifications to the backbone, making it adaptable to any modern…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReconstructive Surgery and Microvascular Techniques · Orthopedic Surgery and Rehabilitation
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Residual Connection · Convolution · Batch Normalization · HRNet
