Depth by Poking: Learning to Estimate Depth from Self-Supervised   Grasping

Ben Goodrich; Alex Kuefler; William D. Richards

arXiv:2006.08903·cs.CV·June 17, 2020

Depth by Poking: Learning to Estimate Depth from Self-Supervised Grasping

Ben Goodrich, Alex Kuefler, William D. Richards

PDF

Open Access

TL;DR

This paper introduces a self-supervised neural network that estimates depth from RGB-D images using robot interactions, improving depth accuracy especially on reflective or transparent surfaces without additional labeling.

Contribution

It presents a novel self-supervised approach for depth estimation from physical robot interactions, eliminating the need for human-labeled data and enhancing performance on challenging surfaces.

Findings

01

Lower RMSE than structured light sensors

02

Outperforms unsupervised deep learning methods

03

Effective on industry-scale jumbled bin datasets

Abstract

Accurate depth estimation remains an open problem for robotic manipulation; even state of the art techniques including structured light and LiDAR sensors fail on reflective or transparent surfaces. We address this problem by training a neural network model to estimate depth from RGB-D images, using labels from physical interactions between a robot and its environment. Our network predicts, for each pixel in an input image, the z position that a robot's end effector would reach if it attempted to grasp or poke at the corresponding position. Given an autonomous grasping policy, our approach is self-supervised as end effector position labels can be recovered through forward kinematics, without human annotation. Although gathering such physical interaction data is expensive, it is necessary for training and routine operation of state of the art manipulation systems. Therefore, this depth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Advanced Vision and Imaging · Robotics and Sensor-Based Localization