Synthetic Data Matters: Re-training with Geo-typical Synthetic Labels for Building Detection
Shuang Song, Yang Tang, Rongjun Qin

TL;DR
This paper introduces a method for improving building segmentation in remote sensing by re-training models with geo-typical synthetic data tailored to specific regions, significantly enhancing generalization without extensive real annotations.
Contribution
The paper presents a novel approach that generates geo-typical synthetic data and integrates it into an adversarial domain adaptation framework for better building segmentation.
Findings
Median performance improvements of up to 12%
Synthetic data closely mimics target urban structures
Effective domain gap reduction through adversarial adaptation
Abstract
Deep learning has significantly advanced building segmentation in remote sensing, yet models struggle to generalize on data of diverse geographic regions due to variations in city layouts and the distribution of building types, sizes and locations. However, the amount of time-consuming annotated data for capturing worldwide diversity may never catch up with the demands of increasingly data-hungry models. Thus, we propose a novel approach: re-training models at test time using synthetic data tailored to the target region's city layout. This method generates geo-typical synthetic data that closely replicates the urban structure of a target area by leveraging geospatial data such as street network from OpenStreetMap. Using procedural modeling and physics-based rendering, very high-resolution synthetic images are created, incorporating domain randomization in building shapes, materials, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutomated Road and Building Extraction · Geographic Information Systems Studies
