3D Instance Segmentation of MVS Buildings
Jiazhou Chen, Yanghui Xu, Shufang Lu, Ronghua Liang, and Liangliang, Nan

TL;DR
This paper introduces a new 3D instance segmentation framework for urban buildings using multi-view images, effectively handling attached and embedded structures, and provides a novel annotated dataset for outdoor 3D segmentation.
Contribution
The work presents a novel multi-view 3D instance segmentation method for urban buildings and introduces the first outdoor dataset with detailed 3D building annotations.
Findings
Effective segmentation of attached 3D buildings in urban scenes.
Multi-view framework outperforms orthophoto-based methods.
New dataset enhances research in 3D urban scene analysis.
Abstract
We present a novel 3D instance segmentation framework for Multi-View Stereo (MVS) buildings in urban scenes. Unlike existing works focusing on semantic segmentation of urban scenes, the emphasis of this work lies in detecting and segmenting 3D building instances even if they are attached and embedded in a large and imprecise 3D surface model. Multi-view RGB images are first enhanced to RGBH images by adding a heightmap and are segmented to obtain all roof instances using a fine-tuned 2D instance segmentation neural network. Instance masks from different multi-view images are then clustered into global masks. Our mask clustering accounts for spatial occlusion and overlapping, which can eliminate segmentation ambiguities among multi-view images. Based on these global masks, 3D roof instances are segmented out by mask back-projections and extended to the entire building instances through a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
