RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation
Jiayi Song, Kaiyu Li, Xiangyong Cao, Deyu Meng

TL;DR
This paper introduces RS-MTDF, a novel semi-supervised remote sensing segmentation framework that leverages multiple pre-trained vision foundation models as teachers to improve generalization and semantic accuracy.
Contribution
The paper proposes a multi-teacher distillation and fusion framework utilizing VFMs like DINOv2 and CLIP to guide semi-supervised learning in remote sensing, achieving state-of-the-art results.
Findings
Outperforms existing methods across various label ratios.
Achieves highest IoU in most semantic categories.
Effectively enhances generalization and semantic understanding.
Abstract
Semantic segmentation in remote sensing images is crucial for various applications, yet its performance is heavily reliant on large-scale, high-quality pixel-wise annotations, which are notoriously expensive and time-consuming to acquire. Semi-supervised semantic segmentation (SSS) offers a promising alternative to mitigate this data dependency. However, existing SSS methods often struggle with the inherent distribution mismatch between limited labeled data and abundant unlabeled data, leading to suboptimal generalization. To alleviate this issue, we attempt to introduce the Vision Foundation Models (VFMs) pre-trained on vast and diverse datasets into the SSS task since VFMs possess robust generalization capabilities that can effectively bridge this distribution gap and provide strong semantic priors for SSS. Inspired by this, we introduce RS-MTDF (Multi-Teacher Distillation and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Advanced Neural Network Applications · Automated Road and Building Extraction
MethodsALIGN
