Text-driven object affordance for guiding grasp-type recognition in multimodal robot teaching
Naoki Wake, Daichi Saito, Kazuhiro Sasabuchi, Hideki Koike, Katsushi, Ikeuchi

TL;DR
This paper explores how text-driven object affordance can enhance image-based grasp-type recognition in robot teaching, especially under limited visual information, by filtering and emphasizing likely grasp types based on object knowledge.
Contribution
It introduces a method leveraging text-driven object affordance to improve grasp recognition accuracy in multimodal robot teaching scenarios.
Findings
Object affordance improves grasp recognition by filtering unlikely types.
The effect is stronger with objects having a clear grasp bias.
Object affordance benefits recognition even with limited visual info.
Abstract
This study investigates how text-driven object affordance, which provides prior knowledge about grasp types for each object, affects image-based grasp-type recognition in robot teaching. The researchers created labeled datasets of first-person hand images to examine the impact of object affordance on recognition performance. They evaluated scenarios with real and illusory objects, considering mixed reality teaching conditions where visual object information may be limited. The results demonstrate that object affordance improves image-based recognition by filtering out unlikely grasp types and emphasizing likely ones. The effectiveness of object affordance was more pronounced when there was a stronger bias towards specific grasp types for each object. These findings highlight the significance of object affordance in multimodal robot teaching, regardless of whether real objects are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning
