Grounding 3D Scene Affordance From Egocentric Interactions
Cuiyu Liu, Wei Zhai, Yuhang Yang, Hongchen Luo, Sen Liang, Yang Cao,, and Zheng-Jun Zha

TL;DR
This paper introduces a novel task of grounding 3D scene affordance from egocentric videos, enabling agents to actively identify interaction-relevant regions in 3D environments, supported by a new dataset and a specialized framework.
Contribution
It proposes the Ego-SAG framework for egocentric interaction-driven 3D affordance grounding and introduces the VSAD dataset for training and evaluation.
Findings
Ego-SAG effectively identifies interaction-relevant regions in 3D scenes.
The VSAD dataset covers diverse interactions and environments.
Experiments show the approach outperforms baselines in accuracy.
Abstract
Grounding 3D scene affordance aims to locate interactive regions in 3D environments, which is crucial for embodied agents to interact intelligently with their surroundings. Most existing approaches achieve this by mapping semantics to 3D instances based on static geometric structure and visual appearance. This passive strategy limits the agent's ability to actively perceive and engage with the environment, making it reliant on predefined semantic instructions. In contrast, humans develop complex interaction skills by observing and imitating how others interact with their surroundings. To empower the model with such abilities, we introduce a novel task: grounding 3D scene affordance from egocentric interactions, where the goal is to identify the corresponding affordance regions in a 3D scene based on an egocentric video of an interaction. This task faces the challenges of spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Advanced Vision and Imaging · 3D Shape Modeling and Analysis
