Selective Contrastive Learning for Weakly Supervised Affordance Grounding

WonJun Moon; Hyun Seok Seong; Jae-Pil Heo

arXiv:2508.07877·cs.CV·August 12, 2025

Selective Contrastive Learning for Weakly Supervised Affordance Grounding

WonJun Moon, Hyun Seok Seong, Jae-Pil Heo

PDF

Open Access 1 Models

TL;DR

This paper introduces a novel selective contrastive learning approach for weakly supervised affordance grounding, leveraging cross-view object discovery and contrastive objectives to improve localization of functional parts without pixel-level annotations.

Contribution

It proposes a new method that adaptively learns affordance-relevant cues at both part and object levels using contrastive objectives and cross-view object discovery, advancing weakly supervised affordance grounding.

Findings

01

Effective localization of affordance-relevant regions demonstrated.

02

Improved performance over existing weakly supervised methods.

03

Cross-view object discovery enhances part-level affordance cues.

Abstract

Facilitating an entity's interaction with objects requires accurately identifying parts that afford specific actions. Weakly supervised affordance grounding (WSAG) seeks to imitate human learning from third-person demonstrations, where humans intuitively grasp functional parts without needing pixel-level annotations. To achieve this, grounding is typically learned using a shared classifier across images from different perspectives, along with distillation strategies incorporating part discovery process. However, since affordance-relevant parts are not always easily distinguishable, models primarily rely on classification, often focusing on common class-specific patterns that are unrelated to affordance. To address this limitation, we move beyond isolated part-level learning by introducing selective prototypical and pixel contrastive objectives that adaptively learn affordance-relevant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
hynnsk/SelectiveCL
model· ♡ 3
♡ 3

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Generative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics