MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare
Yann Labb\'e, Lucas Manuelli, Arsalan Mousavian, Stephen Tyree, Stan, Birchfield, Jonathan Tremblay, Justin Carpentier, Mathieu Aubry, Dieter Fox,, Josef Sivic

TL;DR
MegaPose is a novel 6D pose estimation method for unseen objects that leverages synthetic data, render-and-compare refinement, and a coarse-to-fine approach, achieving state-of-the-art results without retraining on new objects.
Contribution
Introduces a render&compare based 6D pose refiner, a coarse pose classifier, and a large synthetic dataset for generalizing pose estimation to novel objects.
Findings
Achieves state-of-the-art on ModelNet and YCB-Video datasets.
Performs competitively on BOP challenge datasets.
Generalizes well to unseen objects without retraining.
Abstract
We introduce MegaPose, a method to estimate the 6D pose of novel objects, that is, objects unseen during training. At inference time, the method only assumes knowledge of (i) a region of interest displaying the object in the image and (ii) a CAD model of the observed object. The contributions of this work are threefold. First, we present a 6D pose refiner based on a render&compare strategy which can be applied to novel objects. The shape and coordinate system of the novel object are provided as inputs to the network by rendering multiple synthetic views of the object's CAD model. Second, we introduce a novel approach for coarse pose estimation which leverages a network trained to classify whether the pose error between a synthetic rendering and an observed image of the same object can be corrected by the refiner. Third, we introduce a large-scale synthetic dataset of photorealistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · 3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis
