SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using   Vision-Language Models

Jonathan Roberts; Kai Han; Samuel Albanie

arXiv:2304.11619·cs.CV·April 25, 2023·6 cites

SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using Vision-Language Models

Jonathan Roberts, Kai Han, Samuel Albanie

PDF

Open Access 1 Datasets

TL;DR

SATIN is a comprehensive multi-task metadataset derived from 27 satellite imagery datasets, designed to evaluate vision-language models' zero-shot classification across Earth's geographic diversity, highlighting current challenges and progress in remote sensing interpretation.

Contribution

This work introduces SATIN, the first large-scale, diverse satellite imagery metadataset, and evaluates vision-language models' zero-shot transfer capabilities on this challenging benchmark.

Findings

01

Strongest model achieves 52.0% accuracy

02

SATIN presents a challenging benchmark for remote sensing classification

03

Provides a public leaderboard to track model progress

Abstract

Interpreting remote sensing imagery enables numerous downstream applications ranging from land-use planning to deforestation monitoring. Robustly classifying this data is challenging due to the Earth's geographic diversity. While many distinct satellite and aerial image classification datasets exist, there is yet to be a benchmark curated that suitably covers this diversity. In this work, we introduce SATellite ImageNet (SATIN), a metadataset curated from 27 existing remotely sensed datasets, and comprehensively evaluate the zero-shot transfer classification capabilities of a broad range of vision-language (VL) models on SATIN. We find SATIN to be a challenging benchmark-the strongest method we evaluate achieves a classification accuracy of 52.0%. We provide a $\href$ to guide and track the progress of VL models in this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

jonathan-roberts1/SATIN
dataset· 74 dl
74 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Genomics and Phylogenetic Studies · Advanced Image and Video Retrieval Techniques