Efficient and Robust Training of Dense Object Nets for Multi-Object Robot Manipulation
David B. Adrian, Andras Gabor Kupcsik, Markus Spies, Heiko Neumann

TL;DR
This paper introduces a simplified and improved training framework for Dense Object Nets that enhances multi-object manipulation capabilities, robustness, and efficiency, reducing data needs and hyperparameter sensitivity.
Contribution
It presents a new training regime with multi-object data, an alternative loss function, and augmentation strategies that outperform previous methods in robustness and accuracy.
Findings
Higher precision in object tracking
Robustness in real-world grasping tasks
Reduced data and hyperparameter sensitivity
Abstract
We propose a framework for robust and efficient training of Dense Object Nets (DON) with a focus on multi-object robot manipulation scenarios. DON is a popular approach to obtain dense, view-invariant object descriptors, which can be used for a multitude of downstream tasks in robot manipulation, such as, pose estimation, state representation for control, etc.. However, the original work focused training on singulated objects, with limited results on instance-specific, multi-object applications. Additionally, a complex data collection pipeline, including 3D reconstruction and mask annotation of each object, is required for training. In this paper, we further improve the efficacy of DON with a simplified data collection and training regime, that consistently yields higher precision and enables robust tracking of keypoints with less data requirements. In particular, we focus on training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Human Pose and Action Recognition · Advanced Vision and Imaging
