A4T: Hierarchical Affordance Detection for Transparent Objects Depth Reconstruction and Manipulation
Jiaqi Jiang, Guanqun Cao, Thanh-Toan Do, Shan Luo

TL;DR
This paper introduces A4T, a hierarchical framework for detecting affordances and reconstructing depth maps of transparent objects, significantly improving accuracy and enabling effective robotic manipulation.
Contribution
The paper presents a novel hierarchical affordance detection framework and a multi-step depth reconstruction method tailored for transparent objects, along with a new dataset for evaluation.
Findings
Significantly reduced depth reconstruction error from 0.097 to 0.042 RMSE.
Accurate affordance maps enable effective manipulation of transparent objects.
Demonstrated successful robotic manipulation experiments.
Abstract
Transparent objects are widely used in our daily lives and therefore robots need to be able to handle them. However, transparent objects suffer from light reflection and refraction, which makes it challenging to obtain the accurate depth maps required to perform handling tasks. In this paper, we propose a novel affordance-based framework for depth reconstruction and manipulation of transparent objects, named A4T. A hierarchical AffordanceNet is first used to detect the transparent objects and their associated affordances that encode the relative positions of an object's different parts. Then, given the predicted affordance map, a multi-step depth reconstruction method is used to progressively reconstruct the depth maps of transparent objects. Finally, the reconstructed depth maps are employed for the affordance-based manipulation of transparent objects. To evaluate our proposed method,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Advanced Vision and Imaging · Robotics and Sensor-Based Localization
