One-Shot Object Affordance Detection in the Wild
Wei Zhai, Hongchen Luo, Jing Zhang, Yang Cao, Dacheng Tao

TL;DR
This paper introduces OSAD-Net, a novel one-shot affordance detection network that estimates human action purpose to detect common object affordances in unseen scenes, supported by a large-scale annotated dataset PADv2.
Contribution
The paper proposes a new one-shot affordance detection method, OSAD-Net, and provides a large-scale dataset PADv2 for benchmarking and advancing affordance detection research.
Findings
OSAD-Net outperforms previous models on PADv2 in accuracy and visual quality.
The PADv2 dataset contains 30k images with rich annotations for diverse affordances.
Experimental results validate the effectiveness of the proposed approach.
Abstract
Affordance detection refers to identifying the potential action possibilities of objects in an image, which is a crucial ability for robot perception and manipulation. To empower robots with this ability in unseen scenarios, we first study the challenging one-shot affordance detection problem in this paper, i.e., given a support image that depicts the action purpose, all objects in a scene with the common affordance should be detected. To this end, we devise a One-Shot Affordance Detection Network (OSAD-Net) that firstly estimates the human action purpose and then transfers it to help detect the common affordance from all candidate images. Through collaboration learning, OSAD-Net can capture the common characteristics between objects having the same underlying affordance and learn a good adaptation capability for perceiving unseen affordances. Besides, we build a large-scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
