SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping

Mingxu Zhang; Xiaoqi Li; Jiahui Xu; Kaichen Zhou; Hojin Bae; Yan Shen; Chuyan Xiong; Hao Dong

arXiv:2505.24305·cs.RO·June 23, 2025

SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping

Mingxu Zhang, Xiaoqi Li, Jiahui Xu, Kaichen Zhou, Hojin Bae, Yan Shen, Chuyan Xiong, Hao Dong

PDF

Open Access

TL;DR

SR3D is a training-free framework that enables robotic grasping of transparent and specular objects from a single view by reconstructing accurate 3D models and localizing objects within scenes using view and keypoint matching.

Contribution

It introduces a novel single-view 3D reconstruction method that does not require training, specifically designed for transparent and specular objects in robotic grasping.

Findings

01

Effective 3D reconstruction in simulation and real-world scenarios

02

Improved grasp detection accuracy for challenging materials

03

Utilizes view and keypoint matching for precise object localization

Abstract

Recent advancements in 3D robotic manipulation have improved grasping of everyday objects, but transparent and specular materials remain challenging due to depth sensing limitations. While several 3D reconstruction and depth completion approaches address these challenges, they suffer from setup complexity or limited observation information utilization. To address this, leveraging the power of single view 3D object reconstruction approaches, we propose a training free framework SR3D that enables robotic grasping of transparent and specular objects from a single view observation. Specifically, given single view RGB and depth images, SR3D first uses the external visual models to generate 3D reconstructed object mesh based on RGB image. Then, the key idea is to determine the 3D object's pose and scale to accurately localize the reconstructed object back into its original depth corrupted 3D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Hand Gesture Recognition Systems