GraspME -- Grasp Manifold Estimator
Janik Hager, Ruben Bauer, Marc Toussaint, Jim Mainprice

TL;DR
This paper presents GraspME, a novel method for estimating grasp manifolds directly from 2D images, enabling robots to identify multiple grasp options for objects using deep learning, with high accuracy and real-time performance.
Contribution
Introducing a grasp manifold estimator that uses Mask R-CNN and learned features to detect grasp affordances in 2D images, generalizing to unseen objects and different viewpoints.
Findings
Achieves 11.5 fps inference speed on GPU.
94.5% average precision for keypoint estimation.
Mean pixel distance of 1.29 for keypoints.
Abstract
In this paper, we introduce a Grasp Manifold Estimator (GraspME) to detect grasp affordances for objects directly in 2D camera images. To perform manipulation tasks autonomously it is crucial for robots to have such graspability models of the surrounding objects. Grasp manifolds have the advantage of providing continuously infinitely many grasps, which is not the case when using other grasp representations such as predefined grasp points. For instance, this property can be leveraged in motion optimization to define goal sets as implicit surface constraints in the robot configuration space. In this work, we restrict ourselves to the case of estimating possible end-effector positions directly from 2D camera images. To this extend, we define grasp manifolds via a set of key points and locate them in images using a Mask R-CNN backbone. Using learned features allows generalizing to different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRegion Proposal Network · Convolution · Softmax · RoIAlign · Mask R-CNN
