End-to-end Weakly-supervised Single-stage Multiple 3D Hand Mesh Reconstruction from a Single RGB Image
Jinwei Ren, Jianke Zhu, and Jialiang Zhang

TL;DR
This paper introduces a novel single-stage, weakly-supervised approach for simultaneous 3D reconstruction of multiple hands from a single RGB image, improving efficiency and accuracy over existing multi-stage methods.
Contribution
It proposes the first single-stage multi-hand reconstruction pipeline with a multi-head auto-encoder and weak supervision, reducing computational redundancy and annotation costs.
Findings
Outperforms state-of-the-art methods on multiple benchmarks.
Effective weakly-supervised training with 2D annotations.
Achieves high accuracy in multi-hand 3D reconstruction.
Abstract
In this paper, we consider the challenging task of simultaneously locating and recovering multiple hands from a single 2D image. Previous studies either focus on single hand reconstruction or solve this problem in a multi-stage way. Moreover, the conventional two-stage pipeline firstly detects hand areas, and then estimates 3D hand pose from each cropped patch. To reduce the computational redundancy in preprocessing and feature extraction, for the first time, we propose a concise but efficient single-stage pipeline for multi-hand reconstruction. Specifically, we design a multi-head auto-encoder structure, where each head network shares the same feature map and outputs the hand center, pose and texture, respectively. Besides, we adopt a weakly-supervised scheme to alleviate the burden of expensive 3D real-world data annotations. To this end, we propose a series of losses optimized by a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForensic Anthropology and Bioarchaeology Studies · Hand Gesture Recognition Systems · Orthopedic Surgery and Rehabilitation
