Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and   Objects for 3D Hand Pose Estimation under Hand-Object Interaction

Anil Armagan; Guillermo Garcia-Hernando; Seungryul Baek; Shreyas; Hampali; Mahdi Rad; Zhaohui Zhang; Shipeng Xie; MingXiu Chen; Boshen Zhang,; Fu Xiong; Yang Xiao; Zhiguo Cao; Junsong Yuan; Pengfei Ren; Weiting Huang,; Haifeng Sun; Marek Hr\'uz; Jakub Kanis; Zden\v{e}k Kr\v{n}oul; Qingfu Wan,; Shile Li; Linlin Yang; Dongheui Lee; Angela Yao; Weiguo Zhou; Sijia Mei,; Yunhui Liu; Adrian Spurr; Umar Iqbal; Pavlo Molchanov; Philippe Weinzaepfel,; Romain Br\'egier; Gr\'egory Rogez; Vincent Lepetit; Tae-Kyun Kim

arXiv:2003.13764·cs.CV·September 11, 2020·6 cites

Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction

Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas, Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, MingXiu Chen, Boshen Zhang,, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren, Weiting Huang,, Haifeng Sun, Marek Hr\'uz, Jakub Kanis

PDF

Open Access

TL;DR

This paper evaluates how well current 3D hand pose estimation methods generalize to unseen poses, shapes, viewpoints, and objects, highlighting challenges and improvements through a public challenge and analysis of various techniques.

Contribution

It introduces the HANDS'19 challenge to assess generalization in 3D hand pose estimation and analyzes factors affecting performance, including data modalities and model choices.

Findings

01

Accuracy improved from 27mm to 13mm mean joint error.

02

Generalization is limited on unseen poses and shapes.

03

Ensemble methods and parametric models enhance performance.

Abstract

We study how well different types of approaches generalise in the task of 3D hand pose estimation under single hand scenarios and hand-object interaction. We show that the accuracy of state-of-the-art methods can drop, and that they fail mostly on poses absent from the training set. Unfortunately, since the space of hand poses is highly dimensional, it is inherently not feasible to cover the whole space densely, despite recent efforts in collecting large-scale training datasets. This sampling problem is even more severe when hands are interacting with objects and/or inputs are RGB rather than depth images, as RGB images also vary with lighting conditions and colors. To address these issues, we designed a public challenge (HANDS'19) to evaluate the abilities of current 3D hand pose estimators (HPEs) to interpolate and extrapolate the poses of a training set. More exactly, HANDS'19 is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Robot Manipulation and Learning · Hand Gesture Recognition Systems