DSXFormer: Dual-Pooling Spectral Squeeze-Expansion and Dynamic Context Attention Transformer for Hyperspectral Image Classification

Farhan Ullah; Irfan Ullah; Khalil Khan; Giovanni Pau; JaKeoung Koo

arXiv:2602.01906·cs.CV·February 3, 2026

DSXFormer: Dual-Pooling Spectral Squeeze-Expansion and Dynamic Context Attention Transformer for Hyperspectral Image Classification

Farhan Ullah, Irfan Ullah, Khalil Khan, Giovanni Pau, JaKeoung Koo

PDF

Open Access

TL;DR

DSXFormer is a novel transformer model for hyperspectral image classification that combines dual-pooling spectral feature calibration with dynamic local context attention, achieving high accuracy and efficiency.

Contribution

Introduces DSXFormer, integrating spectral dual-pooling squeeze-expansion and dynamic context attention to improve spectral discriminability and computational efficiency in HSIC.

Findings

01

Achieves state-of-the-art accuracy on four benchmark datasets.

02

Effectively balances spectral emphasis and spatial context.

03

Reduces computational overhead compared to existing methods.

Abstract

Hyperspectral image classification (HSIC) is a challenging task due to high spectral dimensionality, complex spectral-spatial correlations, and limited labeled training samples. Although transformer-based models have shown strong potential for HSIC, existing approaches often struggle to achieve sufficient spectral discriminability while maintaining computational efficiency. To address these limitations, we propose a novel DSXFormer, a novel dual-pooling spectral squeeze-expansion transformer with Dynamic Context Attention for HSIC. The proposed DSXFormer introduces a Dual-Pooling Spectral Squeeze-Expansion (DSX) block, which exploits complementary global average and max pooling to adaptively recalibrate spectral feature channels, thereby enhancing spectral discriminability and inter-band dependency modeling. In addition, DSXFormer incorporates a Dynamic Context Attention (DCA) mechanism…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRemote-Sensing Image Classification · Domain Adaptation and Few-Shot Learning · Face and Expression Recognition