TL;DR
This paper introduces a semantic-aware self-supervised pre-training method for remote sensing change detection, leveraging point-level supervision to learn spatially-sensitive, semantically discriminative features that improve change detection performance.
Contribution
It proposes a novel semantic-aware self-supervised learning framework with point-level supervision for dense remote sensing change detection, outperforming traditional pre-training methods.
Findings
Significantly outperforms ImageNet pre-training and other SSL methods.
Improves model generalization and data efficiency.
Achieves competitive results with only 20% of training data.
Abstract
Supervised deep learning models depend on massive labeled data. Unfortunately, it is time-consuming and labor-intensive to collect and annotate bitemporal samples containing desired changes. Transfer learning from pre-trained models is effective to alleviate label insufficiency in remote sensing (RS) change detection (CD). We explore the use of semantic information during pre-training. Different from traditional supervised pre-training that learns the mapping from image to label, we incorporate semantic supervision into the self-supervised learning (SSL) framework. Typically, multiple objects of interest (e.g., buildings) are distributed in various locations in an uncurated RS image. Instead of manipulating image-level representations via global pooling, we introduce point-level supervision on per-pixel embeddings to learn spatially-sensitive features, thus benefiting downstream dense…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
