RFTrans: Leveraging Refractive Flow of Transparent Objects for Surface Normal Estimation and Manipulation
Tutian Tang, Jiyu Liu, Jieyi Zhang, Haoyuan Fu, Wenqiang Xu, Cewu Lu

TL;DR
RFTrans introduces a novel RGB-D-based approach leveraging refractive flow to improve surface normal estimation and manipulation of transparent objects, effectively bridging the sim-to-real gap and achieving high grasp success rates.
Contribution
The paper presents RFTrans, a new method that uses refractive flow as an intermediate representation for better geometry estimation of transparent objects, enabling effective robotic manipulation.
Findings
Outperforms baseline in synthetic and real-world tests
Achieves 83% success rate in real robot grasping
Uses synthetic data for training with successful sim-to-real transfer
Abstract
Transparent objects are widely used in our daily lives, making it important to teach robots to interact with them. However, it's not easy because the reflective and refractive effects can make depth cameras fail to give accurate geometry measurements. To solve this problem, this paper introduces RFTrans, an RGB-D-based method for surface normal estimation and manipulation of transparent objects. By leveraging refractive flow as an intermediate representation, the proposed method circumvents the drawbacks of directly predicting the geometry (e.g. surface normal) from images and helps bridge the sim-to-real gap. It integrates the RFNet, which predicts refractive flow, object mask, and boundaries, followed by the F2Net, which estimates surface normal from the refractive flow. To make manipulation possible, a global optimization module will take in the predictions, refine the raw depth, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robot Manipulation and Learning · Human Pose and Action Recognition
