Visual-Geometric Collaborative Guidance for Affordance Learning

Hongchen Luo; Wei Zhai; Jiao Wang; Yang Cao; Zheng-Jun Zha

arXiv:2410.11363·cs.CV·October 16, 2024

Visual-Geometric Collaborative Guidance for Affordance Learning

Hongchen Luo, Wei Zhai, Jiao Wang, Yang Cao, Zheng-Jun Zha

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel visual-geometric collaborative approach for affordance learning that leverages interactive affinity from human-object interactions, improving the recognition of action possibilities in diverse environments.

Contribution

It proposes a new network that combines visual and geometric cues to extract interactive affinity, along with a large contact-driven affordance dataset for training and evaluation.

Findings

01

Outperforms existing models on objective metrics

02

Achieves better visual quality in affordance recognition

03

Demonstrates robustness in diverse environments

Abstract

Perceiving potential ``action possibilities'' (\ie, affordance) regions of images and learning interactive functionalities of objects from human demonstration is a challenging task due to the diversity of human-object interactions. Prevailing affordance learning algorithms often adopt the label assignment paradigm and presume that there is a unique relationship between functional region and affordance label, yielding poor performance when adapting to unseen environments with large appearance variations. In this paper, we propose to leverage interactive affinity for affordance learning, \ie extracting interactive affinity from human-object interaction and transferring it to non-interactive objects. Interactive affinity, which represents the contacts between different parts of the human body and local regions of the target object, can provide inherent cues of interconnectivity between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lhc1224/vcr-net
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning