6D Object Pose Estimation without PnP
Jin Liu, Sheng He

TL;DR
This paper presents an end-to-end RGB-based 6D object pose estimation method that avoids PnP, achieving high accuracy and efficiency through a novel network architecture and rotation representation.
Contribution
The paper introduces a PnP-free, fully convolutional network for 6D pose estimation with a novel rotation representation and a collinear equation layer for direct pose regression.
Findings
Achieves less than 18 ms/object inference time on GPU.
Translational error less than 1.67 cm.
Rotational error less than 2.5 degrees.
Abstract
In this paper, we propose an efficient end-to-end algorithm to tackle the problem of estimating the 6D pose of objects from a single RGB image. Our system trains a fully convolutional network to regress the 3D rotation and the 3D translation in region layer. On this basis, a special layer, Collinear Equation Layer, is added next to region layer to output the 2D projections of the 3D bounding boxs corners. In the back propagation stage, the 6D pose network are adjusted according to the error of the 2D projections. In the detection phase, we directly output the position and pose through the region layer. Besides, we introduce a novel and concise representation of 3D rotation to make the regression more precise and easier. Experiments show that our method outperforms base-line and state of the art methods both at accuracy and efficiency. In the LineMod dataset, our algorithm achieves less…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Image and Object Detection Techniques · Robot Manipulation and Learning
