HUGS: Holistic Urban 3D Scene Understanding via Gaussian Splatting
Hongyu Zhou, Jiahao Shao, Lu Xu, Dongfeng Bai, Weichao Qiu, Bingbing, Liu, Yue Wang, Andreas Geiger, Yiyi Liao

TL;DR
This paper presents a novel 3D Gaussian Splatting pipeline for comprehensive urban scene understanding from RGB images, enabling real-time rendering, semantic parsing, and dynamic scene reconstruction without relying on additional sensors.
Contribution
It introduces a joint optimization method for geometry, appearance, semantics, and motion using static and dynamic 3D Gaussians, improving holistic urban scene understanding.
Findings
Achieves real-time rendering of novel viewpoints.
Provides high-accuracy semantic and motion information.
Effectively reconstructs dynamic scenes with noisy 3D detections.
Abstract
Holistic understanding of urban scenes based on RGB images is a challenging yet important problem. It encompasses understanding both the geometry and appearance to enable novel view synthesis, parsing semantic labels, and tracking moving objects. Despite considerable progress, existing approaches often focus on specific aspects of this task and require additional inputs such as LiDAR scans or manually annotated 3D bounding boxes. In this paper, we introduce a novel pipeline that utilizes 3D Gaussian Splatting for holistic urban scene understanding. Our main idea involves the joint optimization of geometry, appearance, semantics, and motion using a combination of static and dynamic 3D Gaussians, where moving object poses are regularized via physical constraints. Our approach offers the ability to render new viewpoints in real-time, yielding 2D and 3D semantic information with high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Remote Sensing and LiDAR Applications · Automated Road and Building Extraction
MethodsFocus
