TL;DR
This paper introduces a flexible, foundation model-based framework for semantic change detection in remote sensing imagery, achieving state-of-the-art results and high data efficiency.
Contribution
It proposes a modular cascaded decoder and a novel loss function to enhance semantic change detection across diverse models and conditions.
Findings
Achieves new state-of-the-art Sek scores on SECOND and LandsatSCD datasets.
Demonstrates high data efficiency, outperforming full-data baselines with only 50% data.
Shows robustness and generalization across different backbones and radiometric variations.
Abstract
Remote sensing (RS) change detection is essential for interpreting surface dynamics. Semantic change detection (SCD) further enables pixel-level understanding of multi-class transitions, yet remains sensitive to pseudo-changes induced by imaging conditions. Recent RS foundation models extract semantically consistent features across temporal and environmental variations, which is critical for mitigating pseudo-changes. However, existing SCD methods are often rigid and backbone-specific, lacking the flexibility to integrate diverse multi-scale features from emerging foundation models. To this end, we introduce a modular Cascaded Gated Decoder (CG-Decoder) that bridges various backbones and SCD tasks, processing multi-scale features in a coarse-to-fine manner while enabling adaptive change extraction. Building upon the RS foundation model PerA, we present PerASCD, a unified SCD framework.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
