RFTrans: Leveraging Refractive Flow of Transparent Objects for Surface   Normal Estimation and Manipulation

Tutian Tang; Jiyu Liu; Jieyi Zhang; Haoyuan Fu; Wenqiang Xu; Cewu Lu

arXiv:2311.12398·cs.CV·February 20, 2024·1 cites

RFTrans: Leveraging Refractive Flow of Transparent Objects for Surface Normal Estimation and Manipulation

Tutian Tang, Jiyu Liu, Jieyi Zhang, Haoyuan Fu, Wenqiang Xu, Cewu Lu

PDF

Open Access

TL;DR

RFTrans introduces a novel RGB-D-based approach leveraging refractive flow to improve surface normal estimation and manipulation of transparent objects, effectively bridging the sim-to-real gap and achieving high grasp success rates.

Contribution

The paper presents RFTrans, a new method that uses refractive flow as an intermediate representation for better geometry estimation of transparent objects, enabling effective robotic manipulation.

Findings

01

Outperforms baseline in synthetic and real-world tests

02

Achieves 83% success rate in real robot grasping

03

Uses synthetic data for training with successful sim-to-real transfer

Abstract

Transparent objects are widely used in our daily lives, making it important to teach robots to interact with them. However, it's not easy because the reflective and refractive effects can make depth cameras fail to give accurate geometry measurements. To solve this problem, this paper introduces RFTrans, an RGB-D-based method for surface normal estimation and manipulation of transparent objects. By leveraging refractive flow as an intermediate representation, the proposed method circumvents the drawbacks of directly predicting the geometry (e.g. surface normal) from images and helps bridge the sim-to-real gap. It integrates the RFNet, which predicts refractive flow, object mask, and boundaries, followed by the F2Net, which estimates surface normal from the refractive flow. To make manipulation possible, a global optimization module will take in the predictions, refine the raw depth, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robot Manipulation and Learning · Human Pose and Action Recognition