Exploring DINO: Emergent Properties and Limitations for Synthetic Aperture Radar Imagery
Joseph A. Gallego-Mejia, Anna Jungbluth, Laura Mart\'inez-Ferrer, Matt, Allen, Francisco Dorr, Freddie Kalaitzis, Ra\'ul Ramos-Poll\'an

TL;DR
This paper investigates the properties and limitations of the DINO self-supervised learning algorithm applied to SAR imagery, highlighting its potential for remote sensing tasks and the value of attention maps.
Contribution
It demonstrates the application of DINO to SAR data, evaluates attention maps for land cover segmentation, and discusses the limitations and future opportunities of SSL in remote sensing.
Findings
Small performance gains from pre-training over training from scratch
Attention maps have intrinsic value for remote sensing applications
SSL models like DINO can provide useful inputs for land cover analysis
Abstract
Self-supervised learning (SSL) models have recently demonstrated remarkable performance across various tasks, including image segmentation. This study delves into the emergent characteristics of the Self-Distillation with No Labels (DINO) algorithm and its application to Synthetic Aperture Radar (SAR) imagery. We pre-train a vision transformer (ViT)-based DINO model using unlabeled SAR data, and later fine-tune the model to predict high-resolution land cover maps. We rigorously evaluate the utility of attention maps generated by the ViT backbone and compare them with the model's token embedding space. We observe a small improvement in model performance with pre-training compared to training from scratch and discuss the limitations and opportunities of SSL for remote sensing and land cover segmentation. Beyond small performance increases, we show that ViT attention maps hold great…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Underwater Acoustics Research · Synthetic Aperture Radar (SAR) Applications and Techniques
MethodsAttention Is All You Need · Softmax · Linear Layer · Residual Connection · Multi-Head Attention · Layer Normalization · Dense Connections · Vision Transformer · self-DIstillation with NO labels
