RGB Matters: Learning 7-DoF Grasp Poses on Monocular RGBD Images
Minghao Gou, Hao-Shu Fang, Zhanda Zhu, Sheng Xu, Chenxi Wang, Cewu Lu

TL;DR
This paper introduces RGBD-Grasp, a novel pipeline that decouples 7-DoF grasp detection into separate RGB and depth processing steps, improving robustness and success rates in robotic grasping tasks.
Contribution
It proposes a new 7-DoF grasp detection method using separate RGB and depth processing, with a novel Angle-View Net and Fast Analytic Searching modules.
Findings
Achieves state-of-the-art results on GraspNet-1Billion dataset.
Demonstrates high success rates in real robot experiments.
Robust to depth sensor noise and low-quality depth images.
Abstract
General object grasping is an important yet unsolved problem in the field of robotics. Most of the current methods either generate grasp poses with few DoF that fail to cover most of the success grasps, or only take the unstable depth image or point cloud as input which may lead to poor results in some cases. In this paper, we propose RGBD-Grasp, a pipeline that solves this problem by decoupling 7-DoF grasp detection into two sub-tasks where RGB and depth information are processed separately. In the first stage, an encoder-decoder like convolutional neural network Angle-View Net(AVN) is proposed to predict the SO(3) orientation of the gripper at every location of the image. Consequently, a Fast Analytic Searching(FAS) module calculates the opening width and the distance of the gripper to the grasp point. By decoupling the grasp detection problem and introducing the stable RGB modality,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Soft Robotics and Applications · Hand Gesture Recognition Systems
