Object 6D Pose Estimation with Non-local Attention

Jianhan Mei; Henghui Ding; Xudong Jiang

arXiv:2002.08749·cs.CV·February 21, 2020·1 cites

Object 6D Pose Estimation with Non-local Attention

Jianhan Mei, Henghui Ding, Xudong Jiang

PDF

Open Access

TL;DR

This paper presents a deep learning approach for 6D object pose estimation from a single RGB image, integrating pose estimation into detection and enhancing robustness with a non-local attention module, achieving state-of-the-art results.

Contribution

It introduces a novel network that combines 6D pose estimation with object detection and incorporates a non-local self-attention module for occlusion robustness.

Findings

01

Achieves state-of-the-art performance on YCB-video dataset.

02

Demonstrates robustness to occlusion with the non-local attention module.

03

Efficient integration of pose estimation into detection framework.

Abstract

In this paper, we address the challenging task of estimating 6D object pose from a single RGB image. Motivated by the deep learning based object detection methods, we propose a concise and efficient network that integrate 6D object pose parameter estimation into the object detection framework. Furthermore, for more robust estimation to occlusion, a non-local self-attention module is introduced. The experimental results show that the proposed method reaches the state-of-the-art performance on the YCB-video and the Linemod datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · Robot Manipulation and Learning