Indoor Asset Detection in Large Scale 360{\deg} Drone-Captured Imagery via 3D Gaussian Splatting

Monica Tang; Avideh Zakhor

arXiv:2604.05316·cs.CV·April 8, 2026

Indoor Asset Detection in Large Scale 360{\deg} Drone-Captured Imagery via 3D Gaussian Splatting

Monica Tang, Avideh Zakhor

PDF

TL;DR

This paper introduces a novel 3D Gaussian Splatting-based method for indoor asset detection and segmentation from drone imagery, leveraging a 3D object codebook and multi-view mask aggregation.

Contribution

It presents a new approach combining 3D Gaussian Splatting with mask semantics for improved multi-view indoor asset detection and segmentation.

Findings

01

Achieved a 65% F1 score improvement over baselines.

02

Improved object detection mAP by 11% over existing methods.

03

Demonstrated reliable multi-view mask consistency in large indoor scenes.

Abstract

We present an approach for object-level detection and segmentation of target indoor assets in 3D Gaussian Splatting (3DGS) scenes, reconstructed from 360{\deg} drone-captured imagery. We introduce a 3D object codebook that jointly leverages mask semantics and spatial information of their corresponding Gaussian primitives to guide multi-view mask association and indoor asset detection. By integrating 2D object detection and segmentation models with semantically and spatially constrained merging procedures, our method aggregates masks from multiple views into coherent 3D object instances. Experiments on two large indoor scenes demonstrate reliable multi-view mask consistency, improving F1 score by 65% over state-of-the-art baselines, and accurate object-level 3D indoor asset detection, achieving an 11% mAP gain over baseline methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.