TL;DR
Contact-GraspNet is an end-to-end neural network that efficiently generates 6-DoF grasps directly from depth data, significantly improving success rates in cluttered scenes compared to previous methods.
Contribution
It introduces a novel 4-DoF grasp representation rooted in point cloud data, enabling efficient learning and generalization for robotic grasping in cluttered environments.
Findings
Achieves over 90% success rate in robotic grasping of unseen objects.
Reduces failure rate by half compared to recent state-of-the-art methods.
Trained on 17 million simulated grasps, generalizes well to real-world data.
Abstract
Grasping unseen objects in unconstrained, cluttered environments is an essential skill for autonomous robotic manipulation. Despite recent progress in full 6-DoF grasp learning, existing approaches often consist of complex sequential pipelines that possess several potential failure points and run-times unsuitable for closed-loop grasping. Therefore, we propose an end-to-end network that efficiently generates a distribution of 6-DoF parallel-jaw grasps directly from a depth recording of a scene. Our novel grasp representation treats 3D points of the recorded point cloud as potential grasp contacts. By rooting the full 6-DoF grasp pose and width in the observed point cloud, we can reduce the dimensionality of our grasp representation to 4-DoF which greatly facilitates the learning process. Our class-agnostic approach is trained on 17 million simulated grasps and generalizes well to real…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
