Multi-Receptive Field Ensemble with Cross-Entropy Masking for Class Imbalance in Remote Sensing Change Detection
Humza Naveed, Xina Zeng, Mitch Bryson, Nagita Mehrseresht

TL;DR
This paper introduces a novel multi-receptive field ensemble architecture with cross-entropy masking for remote sensing change detection, effectively capturing multi-scale change patterns and addressing class imbalance, leading to improved performance over state-of-the-art methods.
Contribution
It proposes a new architecture combining SAM-based features, multi-receptive field ensemble, and a cross-entropy masking loss for better change detection in remote sensing images.
Findings
Outperforms state-of-the-art methods on four datasets.
Achieves 2.97% F1-score improvement on S2Looking.
Effectively handles class imbalance with CEM loss.
Abstract
Remote sensing change detection (RSCD) is a complex task, where changes often appear at different scales and orientations. Convolutional neural networks (CNNs) are good at capturing local spatial patterns but cannot model global semantics due to limited receptive fields. Alternatively, transformers can model long-range dependencies but are data hungry, and RSCD datasets are not large enough to train these models effectively. To tackle this, this paper presents a new architecture for RSCD which adapts a segment anything (SAM) vision foundation model and processes features from the SAM encoder through a multi-receptive field ensemble to capture local and global change patterns. We propose an ensemble of spatial-temporal feature enhancement (STFE) to capture cross-temporal relations, a decoder to reconstruct change patterns, and a multi-scale decoder fusion with attention (MSDFA) to fuse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
