FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from Single RGB Images
Christian Zimmermann, Duygu Ceylan, Jimei Yang, Bryan Russell, Max, Argus, Thomas Brox

TL;DR
This paper introduces a large-scale, multi-view hand dataset with 3D pose and shape annotations, addressing generalization issues in 3D hand pose estimation from RGB images and enabling improved training and benchmarking.
Contribution
The paper presents the first large-scale, multi-view hand dataset with 3D pose and shape annotations, and a semi-automated annotation method, improving cross-dataset generalization and hand shape prediction.
Findings
Models trained on the new dataset generalize better across datasets.
The dataset enables training of networks predicting full hand shape from a single RGB.
The dataset serves as a benchmark for articulated hand shape estimation.
Abstract
Estimating 3D hand pose from single RGB images is a highly ambiguous problem that relies on an unbiased training dataset. In this paper, we analyze cross-dataset generalization when training on existing datasets. We find that approaches perform well on the datasets they are trained on, but do not generalize to other datasets or in-the-wild scenarios. As a consequence, we introduce the first large-scale, multi-view hand dataset that is accompanied by both 3D hand pose and shape annotations. For annotating this real-world dataset, we propose an iterative, semi-automated `human-in-the-loop' approach, which includes hand fitting optimization to infer both the 3D pose and shape for each sample. We show that methods trained on our dataset consistently perform well when tested on other datasets. Moreover, the dataset allows us to train a network that predicts the full articulated hand shape…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
