Task-Aware 3D Affordance Segmentation via 2D Guidance and Geometric Refinement
Lian He, Meng Liu, Qilang Ye, Yu Zhou, Xiang Deng, Gangyi Ding

TL;DR
This paper introduces TASA, a geometry-optimized framework that combines 2D semantic cues and 3D geometric reasoning to improve scene-level affordance segmentation for embodied agents, achieving higher accuracy and efficiency.
Contribution
TASA is the first to jointly leverage 2D semantic cues and 3D geometric reasoning in a coarse-to-fine manner for 3D affordance segmentation, addressing previous limitations.
Findings
TASA outperforms baselines in accuracy on SceneFun3D.
TASA improves efficiency in scene-level affordance segmentation.
TASA effectively integrates 2D priors with 3D geometry for coherent masks.
Abstract
Understanding 3D scene-level affordances from natural language instructions is essential for enabling embodied agents to interact meaningfully in complex environments. However, this task remains challenging due to the need for semantic reasoning and spatial grounding. Existing methods mainly focus on object-level affordances or merely lift 2D predictions to 3D, neglecting rich geometric structure information in point clouds and incurring high computational costs. To address these limitations, we introduce Task-Aware 3D Scene-level Affordance segmentation (TASA), a novel geometry-optimized framework that jointly leverages 2D semantic cues and 3D geometric reasoning in a coarse-to-fine manner. To improve the affordance detection efficiency, TASA features a task-aware 2D affordance detection module to identify manipulable points from language and visual inputs, guiding the selection of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robot Manipulation and Learning · Robotic Path Planning Algorithms
