RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects

Jaeguk Kim; Jaewoo Park; Keuntek Lee; Nam Ik Cho

arXiv:2505.10841·cs.CV·May 19, 2025

RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects

Jaeguk Kim, Jaewoo Park, Keuntek Lee, Nam Ik Cho

PDF

Open Access

TL;DR

RefPose introduces a novel method for 6D pose estimation of unseen objects by leveraging reference images and geometric correspondences, using a render-and-compare refinement process with correlation-guided attention, achieving state-of-the-art results.

Contribution

The paper presents a new approach that dynamically adapts to unseen objects using reference images and geometric correspondence, improving pose estimation accuracy.

Findings

01

Achieves state-of-the-art performance on BOP datasets.

02

Robustly estimates poses for unseen objects.

03

Maintains competitive runtime performance.

Abstract

Estimating the 6D pose of unseen objects from monocular RGB images remains a challenging problem, especially due to the lack of prior object-specific knowledge. To tackle this issue, we propose RefPose, an innovative approach to object pose estimation that leverages a reference image and geometric correspondence as guidance. RefPose first predicts an initial pose by using object templates to render the reference image and establish the geometric correspondence needed for the refinement stage. During the refinement stage, RefPose estimates the geometric correspondence of the query based on the generated references and iteratively refines the pose through a render-and-compare approach. To enhance this estimation, we introduce a correlation volume-guided attention mechanism that effectively captures correlations between the query and reference images. Unlike traditional methods that depend…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · 3D Shape Modeling and Analysis · Human Pose and Action Recognition

MethodsSoftmax · Attention Is All You Need