iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images
Syed Waqas Zamir, Aditya Arora, Akshita Gupta, Salman Khan, Guolei, Sun, Fahad Shahbaz Khan, Fan Zhu, Ling Shao, Gui-Song Xia, Xiang Bai

TL;DR
The paper introduces iSAID, a large-scale, densely annotated dataset for instance segmentation in aerial images, addressing unique challenges like numerous tiny objects and large-scale variations, and benchmarks existing methods on this new dataset.
Contribution
The creation of the first comprehensive dataset for instance segmentation in aerial imagery, with extensive annotations and a benchmark for evaluating segmentation methods.
Findings
Existing methods perform suboptimally on aerial images.
iSAID contains 15 times more categories than previous datasets.
Specialized solutions are needed for aerial image instance segmentation.
Abstract
Existing Earth Vision datasets are either suitable for semantic segmentation or object detection. In this work, we introduce the first benchmark dataset for instance segmentation in aerial imagery that combines instance-level object detection and pixel-level segmentation tasks. In comparison to instance segmentation in natural scenes, aerial images present unique challenges e.g., a huge number of instances per image, large object-scale variations and abundant tiny objects. Our large-scale and densely annotated Instance Segmentation in Aerial Images Dataset (iSAID) comes with 655,451 object instances for 15 categories across 2,806 high-resolution images. Such precise per-pixel annotations for each instance ensure accurate localization that is essential for detailed scene analysis. Compared to existing small-scale aerial image based instance segmentation datasets, iSAID contains…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization
Methods1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Feature Pyramid Network · Region Proposal Network · Bottom-up Path Augmentation · PAFPN · Adaptive Feature Pooling · Dense Connections · PANet · Softmax
