TL;DR
This paper introduces AeroRIT, a large-scale hyperspectral dataset with new categories like buildings and cars, and benchmarks CNN architectures for scene understanding and object segmentation.
Contribution
It presents AeroRIT, the first comprehensive large-scale hyperspectral scene with extensive annotations, and evaluates CNN models with enhancements for improved aerial scene analysis.
Findings
SegNet, U-Net, and Res-U-Net establish baseline performance.
Adding squeeze and excitation blocks improves segmentation accuracy.
Self-supervised learning enhances encoder initialization.
Abstract
We investigate applying convolutional neural network (CNN) architecture to facilitate aerial hyperspectral scene understanding and present a new hyperspectral dataset-AeroRIT-that is large enough for CNN training. To date the majority of hyperspectral airborne have been confined to various sub-categories of vegetation and roads and this scene introduces two new categories: buildings and cars. To the best of our knowledge, this is the first comprehensive large-scale hyperspectral scene with nearly seven million pixel annotations for identifying cars, roads, and buildings. We compare the performance of three popular architectures - SegNet, U-Net, and Res-U-Net, for scene understanding and object identification via the task of dense semantic segmentation to establish a benchmark for the scene. To further strengthen the network, we add squeeze and excitation blocks for better channel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConcatenated Skip Connection · U-Net · Convolution · Kaiming Initialization · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Softmax · SegNet
