TL;DR
This paper introduces a new dataset and benchmark for 6-DoF pose estimation of household objects, specifically designed for robotic manipulation, including textured models, challenging scenes, and a novel evaluation metric.
Contribution
It provides a readily accessible dataset with synthetic and real images, ground truth annotations, pre-trained models, and a new symmetry-robust pose evaluation metric.
Findings
Pre-trained pose estimators achieve baseline performance on the dataset.
The ADD-H metric effectively handles symmetric objects in pose evaluation.
The dataset facilitates research bridging computer vision and robotics.
Abstract
We present a new dataset for 6-DoF pose estimation of known objects, with a focus on robotic manipulation research. We propose a set of toy grocery objects, whose physical instantiations are readily available for purchase and are appropriately sized for robotic grasping and manipulation. We provide 3D scanned textured models of these objects, suitable for generating synthetic training data, as well as RGBD images of the objects in challenging, cluttered scenes exhibiting partial occlusion, extreme lighting variations, multiple instances per image, and a large variety of poses. Using semi-automated RGBD-to-model texture correspondences, the images are annotated with ground truth poses accurate within a few millimeters. We also propose a new pose evaluation metric called ADD-H based on the Hungarian assignment algorithm that is robust to symmetries in object geometry without requiring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
