InterHand2.6M: A Dataset and Baseline for 3D Interacting Hand Pose   Estimation from a Single RGB Image

Gyeongsik Moon; Shoou-i Yu; He Wen; Takaaki Shiratori; Kyoung Mu Lee

arXiv:2008.09309·cs.CV·August 24, 2020

InterHand2.6M: A Dataset and Baseline for 3D Interacting Hand Pose Estimation from a Single RGB Image

Gyeongsik Moon, Shoou-i Yu, He Wen, Takaaki Shiratori, Kyoung Mu Lee

PDF

2 Repos

TL;DR

This paper introduces a large-scale dataset and a baseline network for 3D interacting hand pose estimation from a single RGB image, addressing the gap in existing research focused mainly on single hand pose estimation.

Contribution

The paper presents InterHand2.6M, a new dataset with 2.6 million labeled frames of single and interacting hands, and InterNet, a baseline network for 3D interacting hand pose estimation.

Findings

01

Significant accuracy improvements in 3D interacting hand pose estimation using the new dataset.

02

InterNet achieves strong baseline performance on InterHand2.6M.

03

Demonstrates feasibility of 3D interacting hand pose estimation from general images.

Abstract

Analysis of hand-hand interactions is a crucial step towards better understanding human behavior. However, most researches in 3D hand pose estimation have focused on the isolated single hand case. Therefore, we firstly propose (1) a large-scale dataset, InterHand2.6M, and (2) a baseline network, InterNet, for 3D interacting hand pose estimation from a single RGB image. The proposed InterHand2.6M consists of \textbf{2.6M labeled single and interacting hand frames} under various poses from multiple subjects. Our InterNet simultaneously performs 3D single and interacting hand pose estimation. In our experiments, we demonstrate big gains in 3D interacting hand pose estimation accuracy when leveraging the interacting hand data in InterHand2.6M. We also report the accuracy of InterNet on InterHand2.6M, which serves as a strong baseline for this new dataset. Finally, we show 3D interacting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.