Aligning Silhouette Topology for Self-Adaptive 3D Human Pose Recovery
Mugalodi Rakesh, Jogendra Nath Kundu, Varun Jampani, R. Venkatesh Babu

TL;DR
This paper introduces a silhouette-based domain adaptation method for 3D human pose estimation that leverages topology-aware loss functions to improve model robustness across diverse real-world scenarios without requiring extensive annotations.
Contribution
It proposes a novel silhouette topology alignment framework using spatial transformations and a Chamfer-inspired loss for effective self-adaptation in unlabeled target domains.
Findings
Outperforms prior methods in diverse in-the-wild datasets
Effective in low-resolution and adversarially perturbed images
Achieves robust 3D pose estimation without auxiliary cues
Abstract
Articulation-centric 2D/3D pose supervision forms the core training objective in most existing 3D human pose estimation techniques. Except for synthetic source environments, acquiring such rich supervision for each real target domain at deployment is highly inconvenient. However, we realize that standard foreground silhouette estimation techniques (on static camera feeds) remain unaffected by domain-shifts. Motivated by this, we propose a novel target adaptation framework that relies only on silhouette supervision to adapt a source-trained model-based regressor. However, in the absence of any auxiliary cue (multi-view, depth, or 2D pose), an isolated silhouette loss fails to provide a reliable pose-specific gradient and requires to be employed in tandem with a topology-centric loss. To this end, we develop a series of convolution-friendly spatial transformations in order to disentangle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · 3D Shape Modeling and Analysis
