Changes-Aware Transformer: Learning Generalized Changes Representation
Dan Wang, Licheng Jiao, Jie Chen, Shuyuan Yang, Fang Liu

TL;DR
This paper introduces a Changes-Aware Transformer that learns a generalized representation of changes in bi-temporal images, improving change detection accuracy and generalization across datasets.
Contribution
The paper proposes a novel Changes-Aware Transformer (CAT) that refines difference features using cosine cross-attention, enabling better change detection and compatibility with existing methods.
Findings
Achieves state-of-the-art performance on remote sensing and street scene datasets.
Effectively perceives changed and unchanged pixels in the difference feature space.
Demonstrates excellent generalization across different datasets.
Abstract
Difference features obtained by comparing the images of two periods play an indispensable role in the change detection (CD) task. However, a pair of bi-temporal images can exhibit diverse changes, which may cause various difference features. Identifying changed pixels with differ difference features to be the same category is thus a challenge for CD. Most nowadays' methods acquire distinctive difference features in implicit ways like enhancing image representation or supervision information. Nevertheless, informative image features only guarantee object semantics are modeled and can not guarantee that changed pixels have similar semantics in the difference feature space and are distinct from those unchanged ones. In this work, the generalized representation of various changes is learned straightforwardly in the difference feature space, and a novel Changes-Aware Transformer (CAT) for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Remote Sensing and Land Use
MethodsMulti-Head Attention · Attention Is All You Need · Layer Normalization · Label Smoothing · Dropout · Byte Pair Encoding · Absolute Position Encodings · Dense Connections · Linear Layer · Softmax
