Feature Guided Masked Autoencoder for Self-supervised Learning in Remote Sensing
Yi Wang, Hugo Hern\'andez Hern\'andez, Conrad M Albrecht, Xiao Xiang, Zhu

TL;DR
This paper introduces FG-MAE, a self-supervised learning method that improves masked autoencoders by reconstructing spectral and spatial features, enhancing semantic understanding especially in noisy SAR images.
Contribution
The paper proposes a novel feature-guided masked autoencoder that reconstructs spectral and spatial features, outperforming traditional pixel-based methods in remote sensing tasks.
Findings
FG-MAE outperforms traditional MAE in remote sensing tasks.
Reconstructing spectral and spatial features improves semantic understanding.
Pretrained vision transformers demonstrate scalability and effectiveness.
Abstract
Self-supervised learning guided by masked image modelling, such as Masked AutoEncoder (MAE), has attracted wide attention for pretraining vision transformers in remote sensing. However, MAE tends to excessively focus on pixel details, thereby limiting the model's capacity for semantic understanding, in particular for noisy SAR images. In this paper, we explore spectral and spatial remote sensing image features as improved MAE-reconstruction targets. We first conduct a study on reconstructing various image features, all performing comparably well or better than raw pixels. Based on such observations, we propose Feature Guided Masked Autoencoder (FG-MAE): reconstructing a combination of Histograms of Oriented Graidents (HOG) and Normalized Difference Indices (NDI) for multispectral images, and reconstructing HOG for SAR images. Experimental results on three downstream tasks illustrate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced SAR Imaging Techniques · Remote-Sensing Image Classification
MethodsMasked autoencoder · Focus
