Semi-supervised 3D Hand-Object Pose Estimation via Pose Dictionary Learning
Zida Cheng, Siheng Chen, Ya Zhang

TL;DR
This paper introduces a semi-supervised approach for 3D hand-object pose estimation that leverages pose dictionary learning and an object-oriented coordinate system to reduce labeling costs and improve accuracy.
Contribution
It proposes a novel semi-supervised framework combining pose dictionary learning and an object-oriented coordinate system for 3D hand-object pose estimation.
Findings
Reduces estimation error by 19.5% for hands and 24.9% for objects.
Outperforms several baseline methods on FPHA and HO-3D datasets.
Demonstrates robustness and effectiveness of the proposed approach.
Abstract
3D hand-object pose estimation is an important issue to understand the interaction between human and environment. Current hand-object pose estimation methods require detailed 3D labels, which are expensive and labor-intensive. To tackle the problem of data collection, we propose a semi-supervised 3D hand-object pose estimation method with two key techniques: pose dictionary learning and an object-oriented coordinate system. The proposed pose dictionary learning module can distinguish infeasible poses by reconstruction error, enabling unlabeled data to provide supervision signals. The proposed object-oriented coordinate system can make 3D estimations equivariant to the camera perspective. Experiments are conducted on FPHA and HO-3D datasets. Our method reduces estimation error by 19.5% / 24.9% for hands/objects compared to straightforward use of labeled data on FPHA and outperforms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Human Pose and Action Recognition · Robot Manipulation and Learning
