Multi-Modal Geometric Learning for Grasping and Manipulation

David Watkins; Jacob Varley; Peter Allen

arXiv:1803.07671·cs.RO·February 13, 2023

Multi-Modal Geometric Learning for Grasping and Manipulation

David Watkins, Jacob Varley, Peter Allen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neural network architecture that combines depth and tactile data to improve 3D object modeling and manipulation in robotics, especially under occlusion and partial views.

Contribution

The work presents a novel multi-modal 3D CNN that integrates tactile and depth information for enhanced geometric reasoning in robotic grasping.

Findings

01

Tactile data significantly improves geometric predictions.

02

The method outperforms existing visual-tactile approaches.

03

Enhanced grasping success rates with combined data.

Abstract

This work provides an architecture that incorporates depth and tactile information to create rich and accurate 3D models useful for robotic manipulation tasks. This is accomplished through the use of a 3D convolutional neural network (CNN). Offline, the network is provided with both depth and tactile information and trained to predict the object's geometry, thus filling in regions of occlusion. At runtime, the network is provided a partial view of an object. Tactile information is acquired to augment the captured depth information. The network can then reason about the object's geometry by utilizing both the collected tactile and depth information. We demonstrate that even small amounts of additional tactile information can be incredibly helpful in reasoning about object geometry. This is particularly true when information from depth alone fails to produce an accurate geometric…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CRLab/visualtactilegrasping
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Tactile and Sensory Interactions · Human Pose and Action Recognition