NeXt2Former-CD: Efficient Remote Sensing Change Detection with Modern Vision Architectures
Yufan Wang, Sokratis Makrogiannis, Chandra Kambhamettu

TL;DR
NeXt2Former-CD introduces a modern vision architecture for remote sensing change detection, outperforming existing methods in accuracy while maintaining practical inference speed, and effectively handling noise and small object shifts.
Contribution
The paper presents NeXt2Former-CD, a novel end-to-end framework combining convolutional and attention-based modules for improved change detection in remote sensing imagery.
Findings
Achieves state-of-the-art results on multiple datasets.
Outperforms recent baselines in F1 score and IoU.
Maintains inference latency comparable to traditional methods.
Abstract
State Space Models (SSMs) have recently gained traction in remote sensing change detection (CD) for their favorable scaling properties. In this paper, we explore the potential of modern convolutional and attention-based architectures as a competitive alternative. We propose NeXt2Former-CD, an end-to-end framework that integrates a Siamese ConvNeXt encoder initialized with DINOv3 weights, a deformable attention-based temporal fusion module, and a Mask2Former decoder. This design is intended to better tolerate residual co-registration noise and small object-level spatial shifts, as well as semantic ambiguity in bi-temporal imagery. Experiments on LEVIR-CD, WHU-CD, and CDD datasets show that our method achieves the best results among the evaluated methods, improving over recent Mamba-based baselines in both F1 score and IoU. Furthermore, despite a larger parameter count, our model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Remote Sensing in Agriculture · Advanced Neural Network Applications
