THE-Pose: Topological Prior with Hybrid Graph Fusion for Estimating Category-Level 6D Object Pose

Eunho Lee; Chaehyeon Song; Seunghoon Jeong; Ayoung Kim

arXiv:2512.10251·cs.CV·December 12, 2025

THE-Pose: Topological Prior with Hybrid Graph Fusion for Estimating Category-Level 6D Object Pose

Eunho Lee, Chaehyeon Song, Seunghoon Jeong, Ayoung Kim

PDF

Open Access

TL;DR

THE-Pose introduces a topological prior and hybrid graph fusion to improve category-level 6D object pose estimation, enhancing robustness against intra-class variations, occlusions, and complex geometries.

Contribution

It proposes a novel framework combining topological surface embedding with hybrid graph fusion to better integrate 2D and 3D features for pose estimation.

Findings

01

Achieves 35.8% improvement over 3D-GC baseline

02

Surpasses previous state-of-the-art by 7.2% on REAL275

03

Demonstrates robustness on complex and occluded objects

Abstract

Category-level object pose estimation requires both global context and local structure to ensure robustness against intra-class variations. However, 3D graph convolution (3D-GC) methods only focus on local geometry and depth information, making them vulnerable to complex objects and visual ambiguities. To address this, we present THE-Pose, a novel category-level 6D pose estimation framework that leverages a topological prior via surface embedding and hybrid graph fusion. Specifically, we extract consistent and invariant topological features from the image domain, effectively overcoming the limitations inherent in existing 3D-GC based methods. Our Hybrid Graph Fusion (HGF) module adaptively integrates the topological features with point-cloud features, seamlessly bridging 2D image context and 3D geometric structure. These fused features ensure stability for unseen or complicated objects,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Robotics and Sensor-Based Localization · Multimodal Machine Learning Applications