Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection
Kangqi Ma, Hao Dong, Yadong Mu

TL;DR
This paper introduces a local occupancy prediction approach using a multi-triplanar scheme to improve 3D object shape understanding for robotic grasping in cluttered scenes, leading to higher grasp success rates.
Contribution
It proposes a novel local occupancy prediction method with a multi-triplanar scheme to enhance shape completion and grasp pose estimation in cluttered environments.
Findings
Outperforms existing methods on GraspNet-1Billion benchmark
Effectively completes unobserved object parts in occluded scenes
Achieves higher grasping average precision
Abstract
This paper addresses the challenge of robotic grasping of general objects. Similar to prior research, the task reads a single-view 3D observation (i.e., point clouds) captured by a depth camera as input. Crucially, the success of object grasping highly demands a comprehensive understanding of the shape of objects within the scene. However, single-view observations often suffer from occlusions (including both self and inter-object occlusions), which lead to gaps in the point clouds, especially in complex cluttered scenes. This renders incomplete perception of the object shape and frequently causes failures or inaccurate pose estimation during object grasping. In this paper, we tackle this issue with an effective albeit simple solution, namely completing grasping-related scene regions through local occupancy prediction. Following prior practice, the proposed model first runs by proposing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Hand Gesture Recognition Systems
