SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting

Mengjiao Ma; Qi Ma; Yue Li; Jiahuan Cheng; Runyi Yang; Bin Ren; Nikola Popovic; Mingqiang Wei; Nicu Sebe; Luc Van Gool; Theo Gevers; Martin R. Oswald; Danda Pani Paudel

arXiv:2506.08710·cs.CV·January 27, 2026

SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting

Mengjiao Ma, Qi Ma, Yue Li, Jiahuan Cheng, Runyi Yang, Bin Ren, Nikola Popovic, Mingqiang Wei, Nicu Sebe, Luc Van Gool, Theo Gevers, Martin R. Oswald, Danda Pani Paudel

PDF

Open Access 1 Datasets 1 Video

TL;DR

This paper introduces SceneSplat++, a large-scale benchmark and dataset for evaluating language Gaussian Splatting methods in 3D scene understanding, highlighting the advantages of the generalizable approach over existing methods.

Contribution

It provides the first comprehensive benchmark for three groups of language Gaussian Splatting methods in 3D space, along with a new large dataset for evaluation.

Findings

01

Generalizable methods outperform others in 3D segmentation.

02

Benchmark reveals the efficiency of the generalizable paradigm.

03

GaussianWorld-49K dataset supports diverse scene understanding.

Abstract

3D Gaussian Splatting (3DGS) serves as a highly performant and efficient encoding of scene geometry, appearance, and semantics. Moreover, grounding language in 3D scenes has proven to be an effective strategy for 3D scene understanding. Current Language Gaussian Splatting line of work fall into three main groups: (i) per-scene optimization-based, (ii) per-scene optimization-free, and (iii) generalizable approach. However, most of them are evaluated only on rendered 2D views of a handful of scenes and viewpoints close to the training views, limiting ability and insight into holistic 3D understanding. To address this gap, we propose the first large-scale benchmark that systematically assesses these three groups of methods directly in 3D space, evaluating on 1060 scenes across three indoor datasets and one outdoor dataset. Benchmark results demonstrate a clear advantage of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

GaussianWorld/scene_splat_49k
dataset· 58 dl
58 dl

Videos

SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting· slideslive

Taxonomy

Topics3D Shape Modeling and Analysis · Face recognition and analysis · Robotics and Sensor-Based Localization