Vision-Only Gaussian Splatting for Collaborative Semantic Occupancy Prediction

Cheng Chen; Hao Huang; Saurabh Bagchi

arXiv:2508.10936·cs.CV·November 25, 2025

Vision-Only Gaussian Splatting for Collaborative Semantic Occupancy Prediction

Cheng Chen, Hao Huang, Saurabh Bagchi

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel vision-only collaborative 3D semantic occupancy prediction method using sparse Gaussian splatting, which reduces communication costs and improves accuracy over existing dense or 2D-based methods.

Contribution

It is the first to leverage sparse 3D semantic Gaussian splatting for collaborative perception, enabling efficient, accurate, and communication-efficient 3D semantic occupancy prediction.

Findings

01

Outperforms single-agent perception by +8.42 mIoU

02

Outperforms baseline collaborative methods by +3.28 mIoU

03

Maintains robust performance with only 34.6% communication volume

Abstract

Collaborative perception enables connected vehicles to share information, overcoming occlusions and extending the limited sensing range inherent in single-agent (non-collaborative) systems. Existing vision-only methods for 3D semantic occupancy prediction commonly rely on dense 3D voxels, which incur high communication costs, or 2D planar features, which require accurate depth estimation or additional supervision, limiting their applicability to collaborative scenarios. To address these challenges, we propose the first approach leveraging sparse 3D semantic Gaussian splatting for collaborative 3D semantic occupancy prediction. By sharing and fusing intermediate Gaussian primitives, our method provides three benefits: a neighborhood-based cross-agent fusion that removes duplicates and suppresses noisy or inconsistent Gaussians; a joint encoding of geometry and semantics in each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Vision-Only Gaussian Splatting for Collaborative Semantic Occupancy Prediction· underline

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · Advanced Vision and Imaging