Adapted Human Pose: Monocular 3D Human Pose Estimation with Zero Real 3D Pose Data
Shuangjun Liu, Naveen Sehgal, Sarah Ostadabbas

TL;DR
This paper introduces AHuP, a domain adaptation approach for monocular 3D human pose estimation that effectively reduces domain shift effects without requiring real 3D pose data, achieving competitive results.
Contribution
The paper presents a novel domain adaptation method combining semantic-aware feature adaptation and skeletal pose adaptation, enabling accurate 3D pose estimation without real 3D data.
Findings
Achieves comparable performance to state-of-the-art models trained on real 3D data.
Proposes a lightweight skeletal pose adaptation module that improves existing models.
Introduces a new synthetic dataset, ScanAva+, for 3D human pose research.
Abstract
The ultimate goal for an inference model is to be robust and functional in real life applications. However, training vs. test data domain gaps often negatively affect model performance. This issue is especially critical for the monocular 3D human pose estimation problem, in which 3D human data is often collected in a controlled lab setting. In this paper, we focus on alleviating the negative effect of domain shift in both appearance and pose space for 3D human pose estimation by presenting our adapted human pose (AHuP) approach. AHuP is built upon two key components: (1) semantically aware adaptation (SAA) for the cross-domain feature space adaptation, and (2) skeletal pose adaptation (SPA) for the pose space adaptation which takes only limited information from the target domain. By using zero real 3D human pose data, one of our adapted synthetic models shows comparable performance with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Video Surveillance and Tracking Methods
MethodsAttentive Walk-Aggregating Graph Neural Network
