Back to RGB: 3D tracking of hands and hand-object interactions based on short-baseline stereo
Paschalis Panteleris (1), Antonis Argyros (1, 2) ((1) Institute, of Computer Science, FORTH, (2) Computer Science Department, University of, Crete)

TL;DR
This paper introduces a novel stereo vision-based method for real-time 3D hand tracking that bypasses depth reconstruction, achieving comparable or superior accuracy to RGBD-based approaches in various interaction scenarios.
Contribution
It presents a new optimization-based approach that uses color consistency for 3D hand tracking from stereo images, eliminating the need for dense 3D reconstruction.
Findings
Stereo-based method matches RGBD accuracy in standard datasets
The approach performs well in real-time hand and object interaction scenarios
It can outperform RGBD methods in certain cases
Abstract
We present a novel solution to the problem of 3D tracking of the articulated motion of human hand(s), possibly in interaction with other objects. The vast majority of contemporary relevant work capitalizes on depth information provided by RGBD cameras. In this work, we show that accurate and efficient 3D hand tracking is possible, even for the case of RGB stereo. A straightforward approach for solving the problem based on such input would be to first recover depth and then apply a state of the art depth-based 3D hand tracking method. Unfortunately, this does not work well in practice because the stereo-based, dense 3D reconstruction of hands is far less accurate than the one obtained by RGBD cameras. Our approach bypasses 3D reconstruction and follows a completely different route: 3D hand tracking is formulated as an optimization problem whose solution is the hand configuration that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Advanced Vision and Imaging
