GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
Yansong Qu, Shaohui Dai, Xinyang Li, Jianghang Lin, Liujuan Cao,, Shengchuan Zhang, Rongrong Ji

TL;DR
GOI introduces a novel method for 3D scene understanding that uses an optimizable semantic hyperplane to accurately locate regions based on natural language, improving upon previous threshold-based approaches.
Contribution
The paper presents a new hyperplane-based approach for open-vocabulary 3D region localization, integrating semantic features with 3D Gaussian Splatting and fine-tuning with RES models.
Findings
Outperforms previous state-of-the-art methods in accuracy.
Efficient semantic feature compression using scene priors.
Enhanced precision in 3D region localization with hyperplane tuning.
Abstract
3D open-vocabulary scene understanding, crucial for advancing augmented reality and robotic applications, involves interpreting and locating specific regions within a 3D space as directed by natural language instructions. To this end, we introduce GOI, a framework that integrates semantic features from 2D vision-language foundation models into 3D Gaussian Splatting (3DGS) and identifies 3D Gaussians of Interest using an Optimizable Semantic-space Hyperplane. Our approach includes an efficient compression method that utilizes scene priors to condense noisy high-dimensional semantic features into compact low-dimensional vectors, which are subsequently embedded in 3DGS. During the open-vocabulary querying process, we adopt a distinct approach compared to existing methods, which depend on a manually set fixed empirical threshold to select regions based on their semantic feature distance to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Image Retrieval and Classification Techniques · Data Management and Algorithms
MethodsSparse Evolutionary Training · Feature Selection
