KGN-Pro: Keypoint-Based Grasp Prediction through Probabilistic 2D-3D Correspondence Learning

Bingran Chen; Baorun Li; Jian Yang; Yong Liu; and Guangyao Zhai

arXiv:2507.14820·cs.RO·August 1, 2025

KGN-Pro: Keypoint-Based Grasp Prediction through Probabilistic 2D-3D Correspondence Learning

Bingran Chen, Baorun Li, Jian Yang, Yong Liu, and Guangyao Zhai

PDF

Open Access

TL;DR

KGN-Pro introduces a probabilistic 3D correspondence learning framework for robotic grasp prediction that leverages RGB-D data and end-to-end training to improve accuracy and robustness over previous methods.

Contribution

It presents a novel probabilistic PnP layer integrated into a keypoint-based grasp network, enabling end-to-end 3D supervision and improved grasp estimation.

Findings

01

Outperforms existing methods in grasp success rate

02

Achieves higher grasp cover rate in experiments

03

Effective in both simulated and real-world scenarios

Abstract

High-level robotic manipulation tasks demand flexible 6-DoF grasp estimation to serve as a basic function. Previous approaches either directly generate grasps from point-cloud data, suffering from challenges with small objects and sensor noise, or infer 3D information from RGB images, which introduces expensive annotation requirements and discretization issues. Recent methods mitigate some challenges by retaining a 2D representation to estimate grasp keypoints and applying Perspective-n-Point (PnP) algorithms to compute 6-DoF poses. However, these methods are limited by their non-differentiable nature and reliance solely on 2D supervision, which hinders the full exploitation of rich 3D information. In this work, we present KGN-Pro, a novel grasping network that preserves the efficiency and fine-grained object grasping of previous KGNs while integrating direct 3D optimization through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Motor Control and Adaptation · Interactive and Immersive Displays