Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields
Shijie Zhou, Haoran Chang, Sicheng Jiang, Zhiwen Fan, Zehao Zhu, Dejia, Xu, Pradyumna Chari, Suya You, Zhangyang Wang, Achuta Kadambi

TL;DR
This paper introduces Feature 3DGS, a method that extends 3D Gaussian Splatting to incorporate semantic features from 2D foundation models, enabling real-time, semantically aware 3D scene editing and segmentation with improved speed and quality.
Contribution
It proposes architectural and training innovations to integrate semantic feature fields into 3D Gaussian Splatting, facilitating fast, high-quality semantic rendering and manipulation.
Findings
Enables real-time semantic segmentation and editing in 3D scenes.
Achieves comparable or better results than existing methods.
First to incorporate point and bounding-box prompts for radiance field manipulation.
Abstract
3D scene representations have gained immense popularity in recent years. Methods that use Neural Radiance fields are versatile for traditional tasks such as novel view synthesis. In recent times, some work has emerged that aims to extend the functionality of NeRF beyond view synthesis, for semantically aware tasks such as editing and segmentation using 3D feature field distillation from 2D foundation models. However, these methods have two major limitations: (a) they are limited by the rendering speed of NeRF pipelines, and (b) implicitly represented feature fields suffer from continuity artifacts reducing feature quality. Recently, 3D Gaussian Splatting has shown state-of-the-art performance on real-time radiance field rendering. In this work, we go one step further: in addition to radiance field rendering, we enable 3D Gaussian splatting on arbitrary-dimension semantic features via 2D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Vision and Imaging · Industrial Vision Systems and Defect Detection
MethodsSegment Anything Model · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Attentive Walk-Aggregating Graph Neural Network
