Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery
Mubashir Noman, Muzammal Naseer, Hisham Cholakkal, Rao Muhammad Anwar,, Salman Khan, Fahad Shahbaz Khan

TL;DR
This paper introduces SatMAE++, a multi-scale, multi-modal transformer pre-training method for remote sensing imagery that significantly improves performance across various datasets by effectively utilizing scale and modality information.
Contribution
The paper proposes SatMAE++, a novel multi-scale, multi-modal transformer pre-training approach that enhances remote sensing image analysis by incorporating scale and modality information.
Findings
Achieves state-of-the-art results on six remote sensing datasets.
Gains 2.5% mAP on BigEarthNet for multi-label classification.
Effective for both optical and multi-spectral imagery.
Abstract
Recent advances in unsupervised learning have demonstrated the ability of large vision models to achieve promising results on downstream tasks by pre-training on large amount of unlabelled data. Such pre-training techniques have also been explored recently in the remote sensing domain due to the availability of large amount of unlabelled data. Different from standard natural image datasets, remote sensing data is acquired from various sensor technologies and exhibit diverse range of scale variations as well as modalities. Existing satellite image pre-training methods either ignore the scale information present in the remote sensing imagery or restrict themselves to use only a single type of data modality. In this paper, we re-visit transformers pre-training and leverage multi-scale information that is effectively utilized with multiple modalities. Our proposed approach, named SatMAE++,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗mubashir04/checkpoint_ViT-L_pretrain_fmow_sentinelmodel
- 🤗mubashir04/checkpoint_ViT-L_pretrain_fmow_rgbmodel· ♡ 1♡ 1
- 🤗mubashir04/checkpoint_ViT-L_finetune_fmow_sentinelmodel· ♡ 2♡ 2
- 🤗mubashir04/checkpoint_ViT-L_finetune_fmow_rgbmodel· ♡ 1♡ 1
- 🤗MVRL/satmaepp_ViT-L_pretrain_fmow_rgbmodel· 138 dl· ♡ 1138 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeochemistry and Geologic Mapping · Remote-Sensing Image Classification · Advanced Image Fusion Techniques
MethodsConvolution
