KGN-Pro: Keypoint-Based Grasp Prediction through Probabilistic 2D-3D Correspondence Learning
Bingran Chen, Baorun Li, Jian Yang, Yong Liu, and Guangyao Zhai

TL;DR
KGN-Pro introduces a probabilistic 3D correspondence learning framework for robotic grasp prediction that leverages RGB-D data and end-to-end training to improve accuracy and robustness over previous methods.
Contribution
It presents a novel probabilistic PnP layer integrated into a keypoint-based grasp network, enabling end-to-end 3D supervision and improved grasp estimation.
Findings
Outperforms existing methods in grasp success rate
Achieves higher grasp cover rate in experiments
Effective in both simulated and real-world scenarios
Abstract
High-level robotic manipulation tasks demand flexible 6-DoF grasp estimation to serve as a basic function. Previous approaches either directly generate grasps from point-cloud data, suffering from challenges with small objects and sensor noise, or infer 3D information from RGB images, which introduces expensive annotation requirements and discretization issues. Recent methods mitigate some challenges by retaining a 2D representation to estimate grasp keypoints and applying Perspective-n-Point (PnP) algorithms to compute 6-DoF poses. However, these methods are limited by their non-differentiable nature and reliance solely on 2D supervision, which hinders the full exploitation of rich 3D information. In this work, we present KGN-Pro, a novel grasping network that preserves the efficiency and fine-grained object grasping of previous KGNs while integrating direct 3D optimization through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Motor Control and Adaptation · Interactive and Immersive Displays
