Consecutive Pretraining: A Knowledge Transfer Learning Strategy with   Relevant Unlabeled Data for Remote Sensing Domain

Tong Zhang; Peng Gao; Hao Dong; Yin Zhuang; Guanqun Wang; Wei Zhang,; He Chen

arXiv:2207.03860·cs.CV·September 15, 2022

Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain

Tong Zhang, Peng Gao, Hao Dong, Yin Zhuang, Guanqun Wang, Wei Zhang,, He Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces ConSecutive PreTraining (CSPT), a self-supervised knowledge transfer strategy using vision transformers, which effectively bridges the domain gap in remote sensing and improves task performance without extensive labeling.

Contribution

The paper proposes CSPT, a novel self-supervised pretraining approach inspired by NLP, tailored for remote sensing, leveraging unlabeled data and vision transformers to enhance downstream task accuracy.

Findings

01

CSPT outperforms supervised pretraining-then-fine-tuning methods.

02

Almost all remote sensing tasks achieve state-of-the-art results with CSPT.

03

CSPT reduces reliance on labeled data and domain-specific large datasets.

Abstract

Currently, under supervised learning, a model pretrained by a large-scale nature scene dataset and then fine-tuned on a few specific task labeling data is the paradigm that has dominated the knowledge transfer learning. It has reached the status of consensus solution for task-aware model training in remote sensing domain (RSD). Unfortunately, due to different categories of imaging data and stiff challenges of data annotation, there is not a large enough and uniform remote sensing dataset to support large-scale pretraining in RSD. Moreover, pretraining models on large-scale nature scene datasets by supervised learning and then directly fine-tuning on diverse downstream tasks seems to be a crude method, which is easily affected by inevitable labeling noise, severe domain gaps and task-aware discrepancies. Thus, in this paper, considering the self-supervised pretraining and powerful vision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhangtong1/transfer_learning_cspt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Remote-Sensing Image Classification · Advanced Image and Video Retrieval Techniques

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Softmax · Multi-Head Attention · Residual Connection · Dense Connections · Vision Transformer