UAGLNet: Uncertainty-Aggregated Global-Local Fusion Network with Cooperative CNN-Transformer for Building Extraction
Siyuan Yao, Dongxiu Liu, Taotao Li, Shengjie Li, Wenqi Ren, Xiaochun Cao

TL;DR
UAGLNet is a novel neural network that combines CNN and transformer architectures with uncertainty modeling to improve building extraction accuracy from remote sensing images by effectively integrating global and local features.
Contribution
The paper introduces a hybrid CNN-transformer encoder with a cooperative interaction block and a global-local fusion module, along with an uncertainty-aggregated decoder for enhanced segmentation.
Findings
Achieves superior performance over state-of-the-art methods.
Effectively reduces segmentation ambiguity in uncertain regions.
Demonstrates robustness across diverse remote sensing datasets.
Abstract
Building extraction from remote sensing images is a challenging task due to the complex structure variations of the buildings. Existing methods employ convolutional or self-attention blocks to capture the multi-scale features in the segmentation models, while the inherent gap of the feature pyramids and insufficient global-local feature integration leads to inaccurate, ambiguous extraction results. To address this issue, in this paper, we present an Uncertainty-Aggregated Global-Local Fusion Network (UAGLNet), which is capable to exploit high-quality global-local visual semantics under the guidance of uncertainty modeling. Specifically, we propose a novel cooperative encoder, which adopts hybrid CNN and transformer layers at different stages to capture the local and global visual semantics, respectively. An intermediate cooperative interaction block (CIB) is designed to narrow the gap…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Remote-Sensing Image Classification · Remote Sensing and LiDAR Applications
