SSFT: A Lightweight Spectral-Spatial Fusion Transformer for Generic Hyperspectral Classification
Alexander Musiat, Nikolas Ebert, Oliver Wasenm\"uller

TL;DR
The paper introduces SSFT, a lightweight spectral-spatial transformer for hyperspectral classification, achieving state-of-the-art results with fewer parameters across diverse datasets.
Contribution
It proposes a novel spectral-spatial fusion transformer that is compact, effective, and adaptable to various hyperspectral classification tasks.
Findings
SSFT ranks first on the HSI-Benchmark with less than 2% of previous model parameters.
SSFT remains competitive on the larger SpectralEarth benchmark despite its small size.
Both spectral and spatial pathways are essential, with spatial modeling being most impactful.
Abstract
Hyperspectral imaging enables fine-grained recognition of materials by capturing rich spectral signatures, but learning robust classifiers is challenging due to high dimensionality, spectral redundancy, limited labeled data, and strong domain shifts. Beyond earth observation, labeled HSI data is often scarce and imbalanced, motivating compact models for generic hyperspectral classification across diverse acquisition regimes. We propose the lightweight Spectral-Spatial Fusion Transformer (SSFT), which factorizes representation learning into spectral and spatial pathways and integrates them via cross-attention to capture complementary wavelength-dependent and structural information. We evaluate our SSFT on the challenging HSI-Benchmark, a heterogeneous multi-dataset benchmark covering earth observation, fruit condition assessment, and fine-grained material recognition. SSFT achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
