Feature-Augmented Deep Networks for Multiscale Building Segmentation in High-Resolution UAV and Satellite Imagery
Chintan B. Maniyar, Minakshi Kumar, Gengchen Mai

TL;DR
This paper introduces a deep learning framework that combines multi-scale RGB imagery, feature augmentation, and advanced training strategies to improve building segmentation accuracy in high-resolution aerial and satellite images.
Contribution
It presents a novel feature-augmented Res-U-Net model with optimized training policies for multiscale building segmentation from diverse remote sensing data.
Findings
Achieved 96.5% accuracy on a challenging dataset
Outperformed existing RGB-based segmentation benchmarks
Demonstrated robustness across different spatial resolutions
Abstract
Accurate building segmentation from high-resolution RGB imagery remains challenging due to spectral similarity with non-building features, shadows, and irregular building geometries. In this study, we present a comprehensive deep learning framework for multiscale building segmentation using RGB aerial and satellite imagery with spatial resolutions ranging from 0.4m to 2.7m. We curate a diverse, multi-sensor dataset and introduce feature-augmented inputs by deriving secondary representations including Principal Component Analysis (PCA), Visible Difference Vegetation Index (VDVI), Morphological Building Index (MBI), and Sobel edge filters from RGB channels. These features guide a Res-U-Net architecture in learning complex spatial patterns more effectively. We also propose training policies incorporating layer freezing, cyclical learning rates, and SuperConvergence to reduce training time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Advanced Neural Network Applications · Automated Road and Building Extraction
