End-to-End Dexterous Grasp Learning from Single-View Point Clouds via a Multi-Object Scene Dataset

Tao Geng; Dapeng Yang; Ziwei Liu; Le Zhang; Le Qi; WangYang Li; Yi Ren; Shan Luo; Fenglei Ni

arXiv:2603.15410·cs.RO·March 17, 2026

End-to-End Dexterous Grasp Learning from Single-View Point Clouds via a Multi-Object Scene Dataset

Tao Geng, Dapeng Yang, Ziwei Liu, Le Zhang, Le Qi, WangYang Li, Yi Ren, Shan Luo, Fenglei Ni

PDF

Open Access

TL;DR

This paper introduces DGS-Net, an end-to-end network for dexterous grasp prediction from single-view point clouds in multi-object scenes, supported by a large dataset and a two-stage data generation strategy, achieving high success rates.

Contribution

The paper presents a novel end-to-end grasp prediction network and a comprehensive multi-object scene dataset, addressing limitations of existing datasets and improving grasping robustness and generalization.

Findings

01

Achieves 88.63% grasp success in simulation

02

Attains 78.98% success on real robot platform

03

Demonstrates lower penetration and better generalization

Abstract

Dexterous grasping in multi-object scene constitutes a fundamental challenge in robotic manipulation. Current mainstream grasping datasets predominantly focus on single-object scenarios and predefined grasp configurations, often neglecting environmental interference and the modeling of dexterous pre-grasp gesture, thereby limiting their generalizability in real-world applications. To address this, we propose DGS-Net, an end-to-end grasp prediction network capable of learning dense grasp configurations from single-view point clouds in multi-object scene. Furthermore, we propose a two-stage grasp data generation strategy that progresses from dense single-object grasp synthesis to dense scene-level grasp generation. Our dataset comprises 307 objects, 240 multi-object scenes, and over 350k validated grasps. By explicitly modeling grasp offsets and pre-grasp configurations, the dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Motor Control and Adaptation · Human Pose and Action Recognition