Tampered VAE for Improved Satellite Image Time Series Classification
Xin Cai, Yaxin Bi, Peter Nicholl

TL;DR
This paper introduces a semi-supervised VAE framework combined with a temporal transformer model for satellite image time series classification, achieving high accuracy with less memory and limited labeled data.
Contribution
It presents a novel VAE-based semi-supervised learning approach integrated with a temporal transformer, reducing memory use and improving crop classification performance.
Findings
Superior classification accuracy with limited labeled data.
Reduced GPU memory consumption compared to spatial-temporal models.
Latent space clustering enhances interpretability.
Abstract
The unprecedented availability of spatial and temporal high-resolution satellite image time series (SITS) for crop type mapping is believed to necessitate deep learning architectures to accommodate challenges arising from both dimensions. Recent state-of-the-art deep learning models have shown promising results by stacking spatial and temporal encoders. However, we present a Pyramid Time-Series Transformer (PTST) that operates solely on the temporal dimension, i.e., neglecting the spatial dimension, can produce superior results with a drastic reduction in GPU memory consumption and easy extensibility. Furthermore, we augment it to perform semi-supervised learning by proposing a classification-friendly VAE framework that introduces clustering mechanisms into latent space and can promote linear separability therein. Consequently, a few principal axes of the latent space can explain the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing in Agriculture · Time Series Analysis and Forecasting · Spectroscopy and Chemometric Analyses
MethodsAttention Is All You Need · Linear Layer · Residual Connection · Softmax · Absolute Position Encodings · Layer Normalization · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Dense Connections
