SwinV2DNet: Pyramid and Self-Supervision Compounded Feature Learning for Remote Sensing Images Change Detection
Dalong Zheng, Zebin Wu, Jia Liu, Zhihui Wei

TL;DR
SwinV2DNet is an innovative change detection network combining transformer and CNN features with a novel pyramid and self-supervision strategy, achieving state-of-the-art results on remote sensing datasets.
Contribution
The paper introduces SwinV2DNet, a novel end-to-end network that integrates transformer and CNN features with a mixed feature pyramid and self-supervision for improved change detection.
Findings
Achieved state-of-the-art change detection scores.
Provided fine-grained change maps.
Validated effectiveness of MFP module in other networks.
Abstract
Among the current mainstream change detection networks, transformer is deficient in the ability to capture accurate low-level details, while convolutional neural network (CNN) is wanting in the capacity to understand global information and establish remote spatial relationships. Meanwhile, both of the widely used early fusion and late fusion frameworks are not able to well learn complete change features. Therefore, based on swin transformer V2 (Swin V2) and VGG16, we propose an end-to-end compounded dense network SwinV2DNet to inherit the advantages of both transformer and CNN and overcome the shortcomings of existing networks in feature learning. Firstly, it captures the change relationship features through the densely connected Swin V2 backbone, and provides the low-level pre-changed and post-changed features through a CNN branch. Based on these three change features, we accomplish…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Remote Sensing and Land Use
MethodsMulti-Head Attention · Attention Is All You Need · Stochastic Depth · Linear Layer · Layer Normalization · Dense Connections · Residual Connection · Softmax · Swin Transformer
