Selective Contrastive Learning for Weakly Supervised Affordance Grounding
WonJun Moon, Hyun Seok Seong, Jae-Pil Heo

TL;DR
This paper introduces a novel selective contrastive learning approach for weakly supervised affordance grounding, leveraging cross-view object discovery and contrastive objectives to improve localization of functional parts without pixel-level annotations.
Contribution
It proposes a new method that adaptively learns affordance-relevant cues at both part and object levels using contrastive objectives and cross-view object discovery, advancing weakly supervised affordance grounding.
Findings
Effective localization of affordance-relevant regions demonstrated.
Improved performance over existing weakly supervised methods.
Cross-view object discovery enhances part-level affordance cues.
Abstract
Facilitating an entity's interaction with objects requires accurately identifying parts that afford specific actions. Weakly supervised affordance grounding (WSAG) seeks to imitate human learning from third-person demonstrations, where humans intuitively grasp functional parts without needing pixel-level annotations. To achieve this, grounding is typically learned using a shared classifier across images from different perspectives, along with distillation strategies incorporating part discovery process. However, since affordance-relevant parts are not always easily distinguishable, models primarily rely on classification, often focusing on common class-specific patterns that are unrelated to affordance. To address this limitation, we move beyond isolated part-level learning by introducing selective prototypical and pixel contrastive objectives that adaptively learn affordance-relevant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Generative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics
