Coupled Iterative Refinement for 6D Multi-Object Pose Estimation

Lahav Lipson; Zachary Teed; Ankit Goyal; Jia Deng

arXiv:2204.12516·cs.CV·April 28, 2022·1 cites

Coupled Iterative Refinement for 6D Multi-Object Pose Estimation

Lahav Lipson, Zachary Teed, Ankit Goyal, Jia Deng

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel end-to-end differentiable framework for 6D multi-object pose estimation that iteratively refines pose and correspondence, achieving state-of-the-art accuracy on standard benchmarks.

Contribution

It presents a new coupled iterative refinement architecture with a differentiable BD-PnP layer for improved 6D object pose estimation.

Findings

01

Achieves state-of-the-art accuracy on 6D pose benchmarks.

02

Effectively removes outliers through iterative refinement.

03

Demonstrates the effectiveness of geometric knowledge integration.

Abstract

We address the task of 6D multi-object pose: given a set of known 3D objects and an RGB or RGB-D input image, we detect and estimate the 6D pose of each object. We propose a new approach to 6D object pose estimation which consists of an end-to-end differentiable architecture that makes use of geometric knowledge. Our approach iteratively refines both pose and correspondence in a tightly coupled manner, allowing us to dynamically remove outliers to improve accuracy. We use a novel differentiable layer to perform pose refinement by solving an optimization problem we refer to as Bidirectional Depth-Augmented Perspective-N-Point (BD-PnP). Our method achieves state-of-the-art accuracy on standard 6D Object Pose benchmarks. Code is available at https://github.com/princeton-vl/Coupled-Iterative-Refinement.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

princeton-vl/coupled-iterative-refinement
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Robotics and Sensor-Based Localization