GMM-Based Synthetic Samples for Classification of Hyperspectral Images With Limited Training Data
AmirAbbas Davari, Erchan Aptoula, Berrin Yanikoglu, Andreas Maier,, Christian Riess

TL;DR
This paper proposes a GMM-based synthetic sample generation method to improve hyperspectral image classification with limited training data, demonstrating a median 5% performance gain and stability across sample variations.
Contribution
The study introduces a novel GMM-based data augmentation technique for hyperspectral classification with limited training samples, showing its effectiveness in real-world scenarios.
Findings
Median 5% increase in classification accuracy
Stable performance gain across different sample sizes
Effective augmentation with moderate synthetic data
Abstract
The amount of training data that is required to train a classifier scales with the dimensionality of the feature data. In hyperspectral remote sensing, feature data can potentially become very high dimensional. However, the amount of training data is oftentimes limited. Thus, one of the core challenges in hyperspectral remote sensing is how to perform multi-class classification using only relatively few training data points. In this work, we address this issue by enriching the feature matrix with synthetically generated sample points. This synthetic data is sampled from a GMM fitted to each class of the limited training data. Although, the true distribution of features may not be perfectly modeled by the fitted GMM, we demonstrate that a moderate augmentation by these synthetic samples can effectively replace a part of the missing training samples. We show the efficacy of the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
