Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB   Images

Yuan Liu; Yilin Wen; Sida Peng; Cheng Lin; Xiaoxiao Long; and Taku Komura; Wenping Wang

arXiv:2204.10776·cs.CV·January 30, 2023·5 cites

Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images

Yuan Liu, Yilin Wen, Sida Peng, Cheng Lin, Xiaoxiao Long, and Taku Komura, Wenping Wang

PDF

Open Access 1 Repo

TL;DR

Gen6D is a novel model-free 6-DoF object pose estimation method that generalizes to unseen objects using only posed images, without requiring 3D models or additional sensor data, achieving state-of-the-art results.

Contribution

It introduces a fully generalizable, model-free pose estimator that operates without 3D models or depth data, expanding application scope to unseen objects.

Findings

01

Achieves state-of-the-art results on MOPED and GenMOP datasets.

02

Performs competitively on LINEMOD dataset.

03

Does not require 3D models or depth maps during testing.

Abstract

In this paper, we present a generalizable model-free 6-DoF object pose estimator called Gen6D. Existing generalizable pose estimators either need high-quality object models or require additional depth maps or object masks in test time, which significantly limits their application scope. In contrast, our pose estimator only requires some posed images of the unseen object and is able to accurately predict the poses of the object in arbitrary environments. Gen6D consists of an object detector, a viewpoint selector and a pose refiner, all of which do not require the 3D object model and can generalize to unseen objects. Experiments show that Gen6D achieves state-of-the-art results on two model-free datasets: the MOPED dataset and a new GenMOP dataset collected by us. In addition, on the LINEMOD dataset, Gen6D achieves competitive results compared with instance-specific pose estimators.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liuyuan-pal/gen6d
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Robot Manipulation and Learning