Semantics-aware Test-time Adaptation for 3D Human Pose Estimation
Qiuxia Lin, Rongyu Chen, Kerui Gu, Angela Yao

TL;DR
This paper introduces a semantics-aware test-time adaptation method for 3D human pose estimation that leverages a motion prior and pose completion to improve accuracy, especially under occlusions and truncations.
Contribution
It pioneers the integration of a semantics-aware motion prior and pose completion for test-time adaptation in 3D pose estimation, addressing misalignment issues.
Findings
Over 12% reduction in PA-MPJPE on 3DPW and 3DHP datasets.
Significant improvement over state-of-the-art test-time adaptation techniques.
Enhanced robustness to occlusions and truncations.
Abstract
This work highlights a semantics misalignment in 3D human pose estimation. For the task of test-time adaptation, the misalignment manifests as overly smoothed and unguided predictions. The smoothing settles predictions towards some average pose. Furthermore, when there are occlusions or truncations, the adaptation becomes fully unguided. To this end, we pioneer the integration of a semantics-aware motion prior for the test-time adaptation of 3D pose estimation. We leverage video understanding and a well-structured motion-text space to adapt the model motion prediction to adhere to video semantics during test time. Additionally, we incorporate a missing 2D pose completion based on the motion-text similarity. The pose completion strengthens the motion prior's guidance for occlusions and truncations. Our method significantly improves state-of-the-art 3D human pose estimation TTA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Gait Recognition and Analysis
