THE-Pose: Topological Prior with Hybrid Graph Fusion for Estimating Category-Level 6D Object Pose
Eunho Lee, Chaehyeon Song, Seunghoon Jeong, Ayoung Kim

TL;DR
THE-Pose introduces a topological prior and hybrid graph fusion to improve category-level 6D object pose estimation, enhancing robustness against intra-class variations, occlusions, and complex geometries.
Contribution
It proposes a novel framework combining topological surface embedding with hybrid graph fusion to better integrate 2D and 3D features for pose estimation.
Findings
Achieves 35.8% improvement over 3D-GC baseline
Surpasses previous state-of-the-art by 7.2% on REAL275
Demonstrates robustness on complex and occluded objects
Abstract
Category-level object pose estimation requires both global context and local structure to ensure robustness against intra-class variations. However, 3D graph convolution (3D-GC) methods only focus on local geometry and depth information, making them vulnerable to complex objects and visual ambiguities. To address this, we present THE-Pose, a novel category-level 6D pose estimation framework that leverages a topological prior via surface embedding and hybrid graph fusion. Specifically, we extract consistent and invariant topological features from the image domain, effectively overcoming the limitations inherent in existing 3D-GC based methods. Our Hybrid Graph Fusion (HGF) module adaptively integrates the topological features with point-cloud features, seamlessly bridging 2D image context and 3D geometric structure. These fused features ensure stability for unseen or complicated objects,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Robotics and Sensor-Based Localization · Multimodal Machine Learning Applications
