Lifting by Gaussians: A Simple, Fast and Flexible Method for 3D Instance   Segmentation

Rohan Chacko; Nicolai Haeni; Eldar Khaliullin; Lin Sun; Douglas Lee

arXiv:2502.00173·cs.CV·February 4, 2025

Lifting by Gaussians: A Simple, Fast and Flexible Method for 3D Instance Segmentation

Rohan Chacko, Nicolai Haeni, Eldar Khaliullin, Lin Sun, Douglas Lee

PDF

Open Access

TL;DR

Lifting By Gaussians (LBG) is a fast, flexible, and scene-agnostic method for 3D instance segmentation that leverages 2D segmentation masks and features to directly segment 3D Gaussian radiance fields without per-scene training.

Contribution

LBG introduces a novel, no-training-needed approach for 3D instance segmentation that is faster, simpler, and more modular than previous methods, enabling high-quality semantic segmentation of 3DGS fields.

Findings

01

LBG operates seamlessly on existing 3DGS reconstructions without per-scene training.

02

LBG achieves superior semantic segmentation for novel view synthesis and asset extraction.

03

LBG is an order of magnitude faster and simpler than previous approaches.

Abstract

We introduce Lifting By Gaussians (LBG), a novel approach for open-world instance segmentation of 3D Gaussian Splatted Radiance Fields (3DGS). Recently, 3DGS Fields have emerged as a highly efficient and explicit alternative to Neural Field-based methods for high-quality Novel View Synthesis. Our 3D instance segmentation method directly lifts 2D segmentation masks from SAM (alternately FastSAM, etc.), together with features from CLIP and DINOv2, directly fusing them onto 3DGS (or similar Gaussian radiance fields such as 2DGS). Unlike previous approaches, LBG requires no per-scene training, allowing it to operate seamlessly on any existing 3DGS reconstruction. Our approach is not only an order of magnitude faster and simpler than existing approaches; it is also highly modular, enabling 3D semantic segmentation of existing 3DGS fields without requiring a specific parametrization of the 3D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Image Processing and 3D Reconstruction · Industrial Vision Systems and Defect Detection

MethodsContrastive Language-Image Pre-training · Segment Anything Model