Text-driven object affordance for guiding grasp-type recognition in   multimodal robot teaching

Naoki Wake; Daichi Saito; Kazuhiro Sasabuchi; Hideki Koike; Katsushi; Ikeuchi

arXiv:2103.00268·cs.RO·June 7, 2023

Text-driven object affordance for guiding grasp-type recognition in multimodal robot teaching

Naoki Wake, Daichi Saito, Kazuhiro Sasabuchi, Hideki Koike, Katsushi, Ikeuchi

PDF

Open Access 1 Repo

TL;DR

This paper explores how text-driven object affordance can enhance image-based grasp-type recognition in robot teaching, especially under limited visual information, by filtering and emphasizing likely grasp types based on object knowledge.

Contribution

It introduces a method leveraging text-driven object affordance to improve grasp recognition accuracy in multimodal robot teaching scenarios.

Findings

01

Object affordance improves grasp recognition by filtering unlikely types.

02

The effect is stronger with objects having a clear grasp bias.

03

Object affordance benefits recognition even with limited visual info.

Abstract

This study investigates how text-driven object affordance, which provides prior knowledge about grasp types for each object, affects image-based grasp-type recognition in robot teaching. The researchers created labeled datasets of first-person hand images to examine the impact of object affordance on recognition performance. They evaluated scenarios with real and illusory objects, considering mixed reality teaching conditions where visual object information may be limited. The results demonstrate that object affordance improves image-based recognition by filtering out unlikely grasp types and emphasizing likely ones. The effectiveness of object affordance was more pronounced when there was a stronger bias towards specific grasp types for each object. These findings highlight the significance of object affordance in multimodal robot teaching, regardless of whether real objects are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/arr-grasp-type-recognition
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning