Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled   Feature Fields

Shijie Zhou; Haoran Chang; Sicheng Jiang; Zhiwen Fan; Zehao Zhu; Dejia; Xu; Pradyumna Chari; Suya You; Zhangyang Wang; Achuta Kadambi

arXiv:2312.03203·cs.CV·April 9, 2024·2 cites

Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields

Shijie Zhou, Haoran Chang, Sicheng Jiang, Zhiwen Fan, Zehao Zhu, Dejia, Xu, Pradyumna Chari, Suya You, Zhangyang Wang, Achuta Kadambi

PDF

Open Access 2 Repos

TL;DR

This paper introduces Feature 3DGS, a method that extends 3D Gaussian Splatting to incorporate semantic features from 2D foundation models, enabling real-time, semantically aware 3D scene editing and segmentation with improved speed and quality.

Contribution

It proposes architectural and training innovations to integrate semantic feature fields into 3D Gaussian Splatting, facilitating fast, high-quality semantic rendering and manipulation.

Findings

01

Enables real-time semantic segmentation and editing in 3D scenes.

02

Achieves comparable or better results than existing methods.

03

First to incorporate point and bounding-box prompts for radiance field manipulation.

Abstract

3D scene representations have gained immense popularity in recent years. Methods that use Neural Radiance fields are versatile for traditional tasks such as novel view synthesis. In recent times, some work has emerged that aims to extend the functionality of NeRF beyond view synthesis, for semantically aware tasks such as editing and segmentation using 3D feature field distillation from 2D foundation models. However, these methods have two major limitations: (a) they are limited by the rendering speed of NeRF pipelines, and (b) implicitly represented feature fields suffer from continuity artifacts reducing feature quality. Recently, 3D Gaussian Splatting has shown state-of-the-art performance on real-time radiance field rendering. In this work, we go one step further: in addition to radiance field rendering, we enable 3D Gaussian splatting on arbitrary-dimension semantic features via 2D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Vision and Imaging · Industrial Vision Systems and Defect Detection

MethodsSegment Anything Model · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Attentive Walk-Aggregating Graph Neural Network