Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain
Tong Zhang, Peng Gao, Hao Dong, Yin Zhuang, Guanqun Wang, Wei Zhang,, He Chen

TL;DR
This paper introduces ConSecutive PreTraining (CSPT), a self-supervised knowledge transfer strategy using vision transformers, which effectively bridges the domain gap in remote sensing and improves task performance without extensive labeling.
Contribution
The paper proposes CSPT, a novel self-supervised pretraining approach inspired by NLP, tailored for remote sensing, leveraging unlabeled data and vision transformers to enhance downstream task accuracy.
Findings
CSPT outperforms supervised pretraining-then-fine-tuning methods.
Almost all remote sensing tasks achieve state-of-the-art results with CSPT.
CSPT reduces reliance on labeled data and domain-specific large datasets.
Abstract
Currently, under supervised learning, a model pretrained by a large-scale nature scene dataset and then fine-tuned on a few specific task labeling data is the paradigm that has dominated the knowledge transfer learning. It has reached the status of consensus solution for task-aware model training in remote sensing domain (RSD). Unfortunately, due to different categories of imaging data and stiff challenges of data annotation, there is not a large enough and uniform remote sensing dataset to support large-scale pretraining in RSD. Moreover, pretraining models on large-scale nature scene datasets by supervised learning and then directly fine-tuning on diverse downstream tasks seems to be a crude method, which is easily affected by inevitable labeling noise, severe domain gaps and task-aware discrepancies. Thus, in this paper, considering the self-supervised pretraining and powerful vision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Remote-Sensing Image Classification · Advanced Image and Video Retrieval Techniques
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Softmax · Multi-Head Attention · Residual Connection · Dense Connections · Vision Transformer
