PeCLR: Self-Supervised 3D Hand Pose Estimation from monocular RGB via Equivariant Contrastive Learning
Adrian Spurr, Aneesh Dahiya, Xi Wang, Xucong Zhang, Otmar Hilliges

TL;DR
This paper introduces PeCLR, a self-supervised method using equivariant contrastive learning to improve 3D hand pose estimation from monocular RGB images, achieving state-of-the-art results without specialized architectures.
Contribution
It proposes an equivariant contrastive learning framework tailored for 3D hand pose estimation, leveraging unlabeled data to learn invariant and equivariant features.
Findings
Up to 14.5% improvement in PA-EPE on FreiHAND dataset
Equivariant contrastive objectives outperform invariant ones
Standard deep ResNets can achieve state-of-the-art results with unlabeled data
Abstract
Encouraged by the success of contrastive learning on image classification tasks, we propose a new self-supervised method for the structured regression task of 3D hand pose estimation. Contrastive learning makes use of unlabeled data for the purpose of representation learning via a loss formulation that encourages the learned feature representations to be invariant under any image transformation. For 3D hand pose estimation, it too is desirable to have invariance to appearance transformation such as color jitter. However, the task requires equivariance under affine transformations, such as rotation and translation. To address this issue, we propose an equivariant contrastive objective and demonstrate its effectiveness in the context of 3D hand pose estimation. We experimentally investigate the impact of invariant and equivariant contrastive objectives and show that learning equivariant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Robot Manipulation and Learning
MethodsContrastive Learning
