Dual-Tasks Siamese Transformer Framework for Building Damage Assessment
Hongruixuan Chen, Edoardo Nemni, Sofia Vallecorsa, Xi Li, Chen Wu,, Lars Bromley

TL;DR
This paper introduces DamFormer, a novel Transformer-based framework for building damage assessment from multitemporal remote sensing images, outperforming CNN-based methods by modeling long-range pixel relationships.
Contribution
It is the first Transformer-based architecture designed specifically for multitemporal remote sensing damage assessment tasks.
Findings
Demonstrates superior performance on the xBD dataset.
Effectively models non-local relationships in multitemporal images.
Outperforms traditional CNN-based approaches.
Abstract
Accurate and fine-grained information about the extent of damage to buildings is essential for humanitarian relief and disaster response. However, as the most commonly used architecture in remote sensing interpretation tasks, Convolutional Neural Networks (CNNs) have limited ability to model the non-local relationship between pixels. Recently, Transformer architecture first proposed for modeling long-range dependency in natural language processing has shown promising results in computer vision tasks. Considering the frontier advances of Transformer architecture in the computer vision field, in this paper, we present the first attempt at designing a Transformer-based damage assessment architecture (DamFormer). In DamFormer, a siamese Transformer encoder is first constructed to extract non-local and representative deep features from input multitemporal image-pairs. Then, a multitemporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Fire Detection and Safety Systems
MethodsAttention Is All You Need · Linear Layer · Softmax · Dense Connections · Position-Wise Feed-Forward Layer · Adam · Multi-Head Attention · Label Smoothing · Absolute Position Encodings · Residual Connection
