FactoFormer: Factorized Hyperspectral Transformers with Self-Supervised Pretraining
Shaheer Mohamed, Maryam Haghighat, Tharindu Fernando, Sridha, Sridharan, Clinton Fookes, Peyman Moghadam

TL;DR
FactoFormer introduces a factorized hyperspectral transformer with self-supervised pretraining, effectively capturing spectral and spatial information, leading to state-of-the-art classification performance on multiple datasets.
Contribution
The paper proposes a novel factorized spectral-spatial transformer with self-supervised pretraining for hyperspectral images, addressing limitations of previous models and enhancing performance.
Findings
Achieves state-of-the-art results on six hyperspectral datasets.
Effectively captures spectral and spatial interactions in HSI data.
Demonstrates the benefits of factorized and self-supervised pretraining strategies.
Abstract
Hyperspectral images (HSIs) contain rich spectral and spatial information. Motivated by the success of transformers in the field of natural language processing and computer vision where they have shown the ability to learn long range dependencies within input data, recent research has focused on using transformers for HSIs. However, current state-of-the-art hyperspectral transformers only tokenize the input HSI sample along the spectral dimension, resulting in the under-utilization of spatial information. Moreover, transformers are known to be data-hungry and their performance relies heavily on large-scale pretraining, which is challenging due to limited annotated hyperspectral data. Therefore, the full potential of HSI transformers has not been fully realized. To overcome these limitations, we propose a novel factorized spectral-spatial transformer that incorporates factorized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Remote Sensing and Land Use · Image Retrieval and Classification Techniques
