MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare

Yann Labb\'e; Lucas Manuelli; Arsalan Mousavian; Stephen Tyree; Stan; Birchfield; Jonathan Tremblay; Justin Carpentier; Mathieu Aubry; Dieter Fox,; Josef Sivic

arXiv:2212.06870·cs.CV·December 15, 2022·69 cites

MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare

Yann Labb\'e, Lucas Manuelli, Arsalan Mousavian, Stephen Tyree, Stan, Birchfield, Jonathan Tremblay, Justin Carpentier, Mathieu Aubry, Dieter Fox,, Josef Sivic

PDF

Open Access 1 Repo

TL;DR

MegaPose is a novel 6D pose estimation method for unseen objects that leverages synthetic data, render-and-compare refinement, and a coarse-to-fine approach, achieving state-of-the-art results without retraining on new objects.

Contribution

Introduces a render&compare based 6D pose refiner, a coarse pose classifier, and a large synthetic dataset for generalizing pose estimation to novel objects.

Findings

01

Achieves state-of-the-art on ModelNet and YCB-Video datasets.

02

Performs competitively on BOP challenge datasets.

03

Generalizes well to unseen objects without retraining.

Abstract

We introduce MegaPose, a method to estimate the 6D pose of novel objects, that is, objects unseen during training. At inference time, the method only assumes knowledge of (i) a region of interest displaying the object in the image and (ii) a CAD model of the observed object. The contributions of this work are threefold. First, we present a 6D pose refiner based on a render&compare strategy which can be applied to novel objects. The shape and coordinate system of the novel object are provided as inputs to the network by rendering multiple synthetic views of the object's CAD model. Second, we introduce a novel approach for coarse pose estimation which leverages a network trained to classify whether the pose error between a synthetic rendering and an observed image of the same object can be corrected by the refiner. Third, we introduce a large-scale synthetic dataset of photorealistic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

megapose6d/megapose6d
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · 3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis