Feature Aggregation Network for Building Extraction from High-resolution Remote Sensing Images
Xuan Zhou, Xuefeng Wei

TL;DR
This paper introduces FANet, a novel neural network architecture that effectively combines global and local features for precise building extraction from high-resolution satellite imagery, addressing the challenge of intraclass variability.
Contribution
The paper presents FANet, which integrates Pyramid Vision Transformer, Feature Aggregation, Difference Elimination, Receptive Field Block, and Dual Attention modules for enhanced feature extraction in remote sensing images.
Findings
FANet outperforms existing methods on multiple datasets.
The model achieves high accuracy in boundary recognition.
Extensive experiments validate the effectiveness of the proposed architecture.
Abstract
The rapid advancement in high-resolution satellite remote sensing data acquisition, particularly those achieving submeter precision, has uncovered the potential for detailed extraction of surface architectural features. However, the diversity and complexity of surface distributions frequently lead to current methods focusing exclusively on localized information of surface features. This often results in significant intraclass variability in boundary recognition and between buildings. Therefore, the task of fine-grained extraction of surface features from high-resolution satellite imagery has emerged as a critical challenge in remote sensing image processing. In this work, we propose the Feature Aggregation Network (FANet), concentrating on extracting both global and local features, thereby enabling the refined extraction of landmark buildings from high-resolution satellite remote…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutomated Road and Building Extraction · Remote-Sensing Image Classification · Remote Sensing and LiDAR Applications
MethodsMulti-Head Attention · Attention Is All You Need · Adam · Byte Pair Encoding · Softmax · Dropout · Label Smoothing · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer
