Closed-Loop Transfer for Weakly-supervised Affordance Grounding

Jiajin Tang; Zhengxuan Wei; Ge Zheng; Sibei Yang

arXiv:2510.17384·cs.CV·October 21, 2025

Closed-Loop Transfer for Weakly-supervised Affordance Grounding

Jiajin Tang, Zhengxuan Wei, Ge Zheng, Sibei Yang

PDF

Open Access

TL;DR

This paper presents LoopTrans, a closed-loop framework for weakly-supervised affordance grounding that improves knowledge transfer between exocentric and egocentric images, enabling better localization of interaction regions even in occluded scenarios.

Contribution

Introduction of LoopTrans, a novel closed-loop transfer framework with cross-modal localization and denoising distillation for enhanced affordance grounding.

Findings

01

Achieves consistent improvements across all metrics on benchmarks.

02

Effectively handles occluded interaction regions.

03

Enhances knowledge extraction through bidirectional transfer.

Abstract

Humans can perform previously unexperienced interactions with novel objects simply by observing others engage with them. Weakly-supervised affordance grounding mimics this process by learning to locate object regions that enable actions on egocentric images, using exocentric interaction images with image-level annotations. However, extracting affordance knowledge solely from exocentric images and transferring it one-way to egocentric images limits the applicability of previous works in complex interaction scenarios. Instead, this study introduces LoopTrans, a novel closed-loop framework that not only transfers knowledge from exocentric to egocentric but also transfers back to enhance exocentric knowledge extraction. Within LoopTrans, several innovative mechanisms are introduced, including unified cross-modal localization and denoising knowledge distillation, to bridge domain gaps…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Robot Manipulation and Learning