Visual-Geometric Collaborative Guidance for Affordance Learning
Hongchen Luo, Wei Zhai, Jiao Wang, Yang Cao, Zheng-Jun Zha

TL;DR
This paper introduces a novel visual-geometric collaborative approach for affordance learning that leverages interactive affinity from human-object interactions, improving the recognition of action possibilities in diverse environments.
Contribution
It proposes a new network that combines visual and geometric cues to extract interactive affinity, along with a large contact-driven affordance dataset for training and evaluation.
Findings
Outperforms existing models on objective metrics
Achieves better visual quality in affordance recognition
Demonstrates robustness in diverse environments
Abstract
Perceiving potential ``action possibilities'' (\ie, affordance) regions of images and learning interactive functionalities of objects from human demonstration is a challenging task due to the diversity of human-object interactions. Prevailing affordance learning algorithms often adopt the label assignment paradigm and presume that there is a unique relationship between functional region and affordance label, yielding poor performance when adapting to unseen environments with large appearance variations. In this paper, we propose to leverage interactive affinity for affordance learning, \ie extracting interactive affinity from human-object interaction and transferring it to non-interactive objects. Interactive affinity, which represents the contacts between different parts of the human body and local regions of the target object, can provide inherent cues of interconnectivity between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning
