SSD-6D: Making RGB-based 3D detection and 6D pose estimation great again

Wadim Kehl; Fabian Manhardt; Federico Tombari; Slobodan Ilic; Nassir; Navab

arXiv:1711.10006·cs.CV·November 29, 2017

SSD-6D: Making RGB-based 3D detection and 6D pose estimation great again

Wadim Kehl, Fabian Manhardt, Federico Tombari, Slobodan Ilic, Nassir, Navab

PDF

1 Repo 1 Video

TL;DR

This paper introduces SSD-6D, a fast and effective RGB-based method for 3D object detection and 6D pose estimation, trained solely on synthetic data, outperforming many RGB-D methods.

Contribution

It extends SSD to cover 6D pose estimation from RGB images and trains exclusively on synthetic data, achieving competitive accuracy and high speed.

Findings

01

Outperforms state-of-the-art RGB-D methods on multiple datasets

02

Operates at around 10Hz, enabling real-time applications

03

Achieves high accuracy using only synthetic training data

Abstract

We present a novel method for detecting 3D model instances and estimating their 6D poses from RGB data in a single shot. To this end, we extend the popular SSD paradigm to cover the full 6D pose space and train on synthetic model data only. Our approach competes or surpasses current state-of-the-art methods that leverage RGB-D data on multiple challenging datasets. Furthermore, our method produces these results at around 10Hz, which is many times faster than the related methods. For the sake of reproducibility, we make our trained networks and detection code publicly available.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wadimkehl/ssd-6d
tf

Videos

SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again· youtube

Taxonomy

MethodsConvolution · Non Maximum Suppression · 1x1 Convolution · SSD