Efficient and Robust Training of Dense Object Nets for Multi-Object   Robot Manipulation

David B. Adrian; Andras Gabor Kupcsik; Markus Spies; Heiko Neumann

arXiv:2206.12145·cs.RO·June 27, 2022

Efficient and Robust Training of Dense Object Nets for Multi-Object Robot Manipulation

David B. Adrian, Andras Gabor Kupcsik, Markus Spies, Heiko Neumann

PDF

Open Access

TL;DR

This paper introduces a simplified and improved training framework for Dense Object Nets that enhances multi-object manipulation capabilities, robustness, and efficiency, reducing data needs and hyperparameter sensitivity.

Contribution

It presents a new training regime with multi-object data, an alternative loss function, and augmentation strategies that outperform previous methods in robustness and accuracy.

Findings

01

Higher precision in object tracking

02

Robustness in real-world grasping tasks

03

Reduced data and hyperparameter sensitivity

Abstract

We propose a framework for robust and efficient training of Dense Object Nets (DON) with a focus on multi-object robot manipulation scenarios. DON is a popular approach to obtain dense, view-invariant object descriptors, which can be used for a multitude of downstream tasks in robot manipulation, such as, pose estimation, state representation for control, etc.. However, the original work focused training on singulated objects, with limited results on instance-specific, multi-object applications. Additionally, a complex data collection pipeline, including 3D reconstruction and mask annotation of each object, is required for training. In this paper, we further improve the efficacy of DON with a simplified data collection and training regime, that consistently yields higher precision and enables robust tracking of keypoints with less data requirements. In particular, we focus on training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Human Pose and Action Recognition · Advanced Vision and Imaging