Prior-guided Fusion of Multimodal Features for Change Detection from Optical-SAR Images

Xuanguang Liu; Lei Ding; Yujie Li; Chenguang Dai; Zhenchao Zhang; Mengmeng Li; Ziyi Yang; Yifan Sun; Yongqi Sun; Hanyun Wang

arXiv:2604.05527·cs.CV·April 8, 2026

Prior-guided Fusion of Multimodal Features for Change Detection from Optical-SAR Images

Xuanguang Liu, Lei Ding, Yujie Li, Chenguang Dai, Zhenchao Zhang, Mengmeng Li, Ziyi Yang, Yifan Sun, Yongqi Sun, Hanyun Wang

PDF

1 Repo

TL;DR

This paper introduces STSF-Net, a novel framework for multimodal change detection combining optical and SAR images, utilizing modality-specific and common features with semantic-guided fusion, and provides a new benchmark dataset.

Contribution

The paper proposes a new multimodal change detection framework with adaptive feature fusion and introduces the first multiclass MMCD dataset with high-resolution optical and SAR images.

Findings

01

Outperforms state-of-the-art methods on multiple datasets.

02

Achieves 3.21%, 1.08%, and 1.32% improvements in mIoU.

03

Demonstrates effective modeling of modality-specific and common features.

Abstract

Multimodal change detection (MMCD) identifies changed areas in multimodal remote sensing (RS) data, demonstrating significant application value in land use monitoring, disaster assessment, and urban sustainable development. However, literature MMCD approaches exhibit limitations in cross-modal interaction and exploiting modality-specific characteristics. This leads to insufficient modeling of fine-grained change information, thus hindering the precise detection of semantic changes in multimodal data. To address the above problems, we propose STSF-Net, a framework designed for MMCD between optical and SAR images. STSF-Net jointly models modality-specific and spatio-temporal common features to enhance change representations. Specifically, modality-specific features are exploited to capture genuine semantic change signals, while spatio-temporal common features are embedded to suppress…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liuxuanguang/STSF-Net
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.