TL;DR
This paper demonstrates that a simple shallow CNN with three specific tricks can achieve state-of-the-art hyperspectral image classification performance with very few labeled pixels, especially when using a new disjoint train-test sampling method.
Contribution
It introduces three tricks for shallow CNNs, a new label-based sampling method, and evaluates the approach on five datasets, showing significant performance improvements with minimal labeled data.
Findings
Achieves up to 99.52% accuracy with 1% labeled pixels
Outperforms baseline methods on five hyperspectral datasets
Highlights the importance of using entire image data for training and testing
Abstract
Spectral-spatial classification of hyperspectral images has been the subject of many studies in recent years. In the presence of only very few labeled pixels, this task becomes challenging. In this paper we address the following two research questions: 1) Can a simple neural network with just a single hidden layer achieve state of the art performance in the presence of few labeled pixels? 2) How is the performance of hyperspectral image classification methods affected when using disjoint train and test sets? We give a positive answer to the first question by using three tricks within a very basic shallow Convolutional Neural Network (CNN) architecture: a tailored loss function, and smooth- and label-based data augmentation. The tailored loss function enforces that neighborhood wavelengths have similar contributions to the features generated during training. A new label-based technique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
