Category-level 6D Object Pose Recovery in Depth Images
Caner Sahin, Tae-Kyun Kim

TL;DR
This paper introduces a novel part-based architecture called Intrinsic Structure Adaptor (ISA) for category-level 6D object pose estimation in depth images, effectively handling intra-class variations and distribution shifts.
Contribution
The study proposes ISA, a new architecture that uses shape-invariant features and graph matching to improve generalization in category-level 6D pose estimation from depth images.
Findings
Promising performance on synthetic datasets
Effective handling of shape discrepancies and distribution shifts
Robust generalization to unseen instances
Abstract
Intra-class variations, distribution shifts among source and target domains are the major challenges of category-level tasks. In this study, we address category-level full 6D object pose estimation in the context of depth modality, introducing a novel part-based architecture that can tackle the above-mentioned challenges. Our architecture particularly adapts the distribution shifts arising from shape discrepancies, and naturally removes the variations of texture, illumination, pose, etc., so we call it as "Intrinsic Structure Adaptor (ISA)". We engineer ISA based on the followings: i) "Semantically Selected Centers (SSC)" are proposed in order to define the "6D pose" at the level of categories. ii) 3D skeleton structures, which we derive as shape-invariant features, are used to represent the parts extracted from the instances of given categories, and privileged one-class learning is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Robotics and Sensor-Based Localization · 3D Surveying and Cultural Heritage
