DSXFormer: Dual-Pooling Spectral Squeeze-Expansion and Dynamic Context Attention Transformer for Hyperspectral Image Classification
Farhan Ullah, Irfan Ullah, Khalil Khan, Giovanni Pau, JaKeoung Koo

TL;DR
DSXFormer is a novel transformer model for hyperspectral image classification that combines dual-pooling spectral feature calibration with dynamic local context attention, achieving high accuracy and efficiency.
Contribution
Introduces DSXFormer, integrating spectral dual-pooling squeeze-expansion and dynamic context attention to improve spectral discriminability and computational efficiency in HSIC.
Findings
Achieves state-of-the-art accuracy on four benchmark datasets.
Effectively balances spectral emphasis and spatial context.
Reduces computational overhead compared to existing methods.
Abstract
Hyperspectral image classification (HSIC) is a challenging task due to high spectral dimensionality, complex spectral-spatial correlations, and limited labeled training samples. Although transformer-based models have shown strong potential for HSIC, existing approaches often struggle to achieve sufficient spectral discriminability while maintaining computational efficiency. To address these limitations, we propose a novel DSXFormer, a novel dual-pooling spectral squeeze-expansion transformer with Dynamic Context Attention for HSIC. The proposed DSXFormer introduces a Dual-Pooling Spectral Squeeze-Expansion (DSX) block, which exploits complementary global average and max pooling to adaptively recalibrate spectral feature channels, thereby enhancing spectral discriminability and inter-band dependency modeling. In addition, DSXFormer incorporates a Dynamic Context Attention (DCA) mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Domain Adaptation and Few-Shot Learning · Face and Expression Recognition
