One2Any: One-Reference 6D Pose Estimation for Any Object

Mengya Liu; Siyuan Li; Ajad Chhatkuli; Prune Truong; Luc Van Gool,; Federico Tombari

arXiv:2505.04109·cs.CV·May 8, 2025

One2Any: One-Reference 6D Pose Estimation for Any Object

Mengya Liu, Siyuan Li, Ajad Chhatkuli, Prune Truong, Luc Van Gool,, Federico Tombari

PDF

Open Access

TL;DR

One2Any introduces a novel single-view RGB-D based method for 6D object pose estimation that generalizes to new objects without needing 3D models or multi-view data, achieving state-of-the-art results.

Contribution

It proposes a new encoding-decoding framework that estimates object pose from a single reference image, enabling scalable training and generalization to unseen objects.

Findings

01

Achieves state-of-the-art accuracy on benchmark datasets.

02

Generalizes well to novel objects without prior 3D models.

03

Operates efficiently with only a single reference image.

Abstract

6D object pose estimation remains challenging for many applications due to dependencies on complete 3D models, multi-view images, or training limited to specific object categories. These requirements make generalization to novel objects difficult for which neither 3D models nor multi-view images may be available. To address this, we propose a novel method One2Any that estimates the relative 6-degrees of freedom (DOF) object pose using only a single reference-single query RGB-D image, without prior knowledge of its 3D model, multi-view data, or category constraints. We treat object pose estimation as an encoding-decoding process, first, we obtain a comprehensive Reference Object Pose Embedding (ROPE) that encodes an object shape, orientation, and texture from a single reference view. Using this embedding, a U-Net-based pose decoding module produces Reference Object Coordinate (ROC) for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Robotics and Sensor-Based Localization