Source-Free and Image-Only Unsupervised Domain Adaptation for Category Level Object Pose Estimation
Prakhar Kaushik, Aayush Mishra, Adam Kortylewski, Alan Yuille

TL;DR
This paper introduces 3DUDA, a novel source-free, image-only unsupervised domain adaptation method for category-level object pose estimation that leverages invariant object subparts and neural feature modeling to adapt without source data or 3D annotations.
Contribution
The paper proposes a new approach that uses invariant object subparts and neural feature representations for effective domain adaptation without source data or 3D annotations.
Findings
Significant improvement in pose estimation accuracy across diverse domain shifts.
Effective adaptation in complex scenarios with nuisances, noise, and occlusion.
Convergence of the EM-based training process to the target domain.
Abstract
We consider the problem of source-free unsupervised category-level pose estimation from only RGB images to a target domain without any access to source domain data or 3D annotations during adaptation. Collecting and annotating real-world 3D data and corresponding images is laborious, expensive, yet unavoidable process, since even 3D pose domain adaptation methods require 3D data in the target domain. We introduce 3DUDA, a method capable of adapting to a nuisance-ridden target domain without 3D or depth data. Our key insight stems from the observation that specific object subparts remain stable across out-of-domain (OOD) scenarios, enabling strategic utilization of these invariant subcomponents for effective model updates. We represent object categories as simple cuboid meshes, and harness a generative model of neural feature activations modeled at each mesh vertex learnt using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Human Pose and Action Recognition · Robot Manipulation and Learning
MethodsFocus
