One-Shot Open Affordance Learning with Foundation Models

Gen Li; Deqing Sun; Laura Sevilla-Lara; Varun Jampani

arXiv:2311.17776·cs.CV·November 30, 2023·1 cites

One-Shot Open Affordance Learning with Foundation Models

Gen Li, Deqing Sun, Laura Sevilla-Lara, Varun Jampani

PDF

Open Access

TL;DR

This paper presents a one-shot learning framework for affordance detection using foundation models, achieving high performance with minimal data and demonstrating strong generalization to unseen objects and affordances.

Contribution

It introduces a novel one-shot affordance learning method leveraging vision-language models, enhancing data efficiency and generalization in affordance segmentation.

Findings

01

Outperforms state-of-the-art models with less than 1% training data

02

Shows strong generalization to unseen objects and affordances

03

Effective alignment of visual features and affordance text embeddings

Abstract

We introduce One-shot Open Affordance Learning (OOAL), where a model is trained with just one example per base object category, but is expected to identify novel objects and affordances. While vision-language models excel at recognizing novel objects and scenes, they often struggle to understand finer levels of granularity such as affordances. To handle this issue, we conduct a comprehensive analysis of existing foundation models, to explore their inherent understanding of affordances and assess the potential for data-limited affordance learning. We then propose a vision-language framework with simple and effective designs that boost the alignment between visual features and affordance text embeddings. Experiments on two affordance segmentation benchmarks show that the proposed method outperforms state-of-the-art models with less than 1% of the full training data, and exhibits…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Robot Manipulation and Learning

MethodsBalanced Selection