3D-GRES: Generalized 3D Referring Expression Segmentation
Changli Wu, Yihang Liu, Jiayi Ji, Yiwei Ma, Haowei Wang, Gen Luo,, Henghui Ding, Xiaoshuai Sun, Rongrong Ji

TL;DR
This paper introduces 3D-GRES, a novel task for segmenting multiple objects in 3D space based on natural language, along with a new model MDIN and a dedicated dataset Multi3DRes, advancing multi-object 3D scene understanding.
Contribution
The paper presents the first generalized framework for multi-object 3D referring expression segmentation, including a new model and dataset to handle complex multi-object scenarios.
Findings
MDIN significantly outperforms existing models on Multi3DRes.
The proposed approach effectively segments multiple 3D objects based on natural language.
The new dataset enables comprehensive evaluation of multi-object 3D segmentation methods.
Abstract
3D Referring Expression Segmentation (3D-RES) is dedicated to segmenting a specific instance within a 3D space based on a natural language description. However, current approaches are limited to segmenting a single target, restricting the versatility of the task. To overcome this limitation, we introduce Generalized 3D Referring Expression Segmentation (3D-GRES), which extends the capability to segment any number of instances based on natural language instructions. In addressing this broader task, we propose the Multi-Query Decoupled Interaction Network (MDIN), designed to break down multi-object segmentation tasks into simpler, individual segmentations. MDIN comprises two fundamental components: Text-driven Sparse Queries (TSQ) and Multi-object Decoupling Optimization (MDO). TSQ generates sparse point cloud features distributed over key targets as the initialization for queries.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Advanced Image and Video Retrieval Techniques · Machine Learning and Data Classification
