TL;DR
This paper introduces STSF-Net, a novel framework for multimodal change detection combining optical and SAR images, utilizing modality-specific and common features with semantic-guided fusion, and provides a new benchmark dataset.
Contribution
The paper proposes a new multimodal change detection framework with adaptive feature fusion and introduces the first multiclass MMCD dataset with high-resolution optical and SAR images.
Findings
Outperforms state-of-the-art methods on multiple datasets.
Achieves 3.21%, 1.08%, and 1.32% improvements in mIoU.
Demonstrates effective modeling of modality-specific and common features.
Abstract
Multimodal change detection (MMCD) identifies changed areas in multimodal remote sensing (RS) data, demonstrating significant application value in land use monitoring, disaster assessment, and urban sustainable development. However, literature MMCD approaches exhibit limitations in cross-modal interaction and exploiting modality-specific characteristics. This leads to insufficient modeling of fine-grained change information, thus hindering the precise detection of semantic changes in multimodal data. To address the above problems, we propose STSF-Net, a framework designed for MMCD between optical and SAR images. STSF-Net jointly models modality-specific and spatio-temporal common features to enhance change representations. Specifically, modality-specific features are exploited to capture genuine semantic change signals, while spatio-temporal common features are embedded to suppress…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
