Multi-view Self-supervised Deep Learning for 6D Pose Estimation in the Amazon Picking Challenge
Andy Zeng, Kuan-Ting Yu, Shuran Song, Daniel Suo, Ed Walker Jr.,, Alberto Rodriguez, Jianxiong Xiao

TL;DR
This paper presents a multi-view self-supervised deep learning approach for 6D object pose estimation in cluttered warehouse environments, demonstrating robustness and accuracy in challenging scenarios, and achieving competitive results in the Amazon Picking Challenge.
Contribution
The paper introduces a self-supervised data generation method for training deep neural networks for 6D pose estimation, reducing manual labeling effort and improving robustness in cluttered scenes.
Findings
Achieved 3rd and 4th place in APC 2016 for stowing and picking tasks.
Demonstrated reliable 6D pose estimation in cluttered, occluded environments.
Provided open-source code, data, and benchmarks for further research.
Abstract
Robot warehouse automation has attracted significant interest in recent years, perhaps most visibly in the Amazon Picking Challenge (APC). A fully autonomous warehouse pick-and-place system requires robust vision that reliably recognizes and locates objects amid cluttered environments, self-occlusions, sensor noise, and a large variety of objects. In this paper we present an approach that leverages multi-view RGB-D data and self-supervised, data-driven learning to overcome those difficulties. The approach was part of the MIT-Princeton Team system that took 3rd- and 4th- place in the stowing and picking tasks, respectively at APC 2016. In the proposed approach, we segment and label multiple views of a scene with a fully convolutional neural network, and then fit pre-scanned 3D object models to the resulting segmentation to get the 6D object pose. Training a deep neural network for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Robot Manipulation and Learning · Advanced Neural Network Applications
