DiffFormer: a Differential Spatial-Spectral Transformer for Hyperspectral Image Classification
Muhammad Ahmad, Manuel Mazzara, Salvatore Distefano, Adil Mehmood, Khan, Silvia Liberata Ullo

TL;DR
DiffFormer introduces a novel differential spatial-spectral transformer architecture that significantly improves hyperspectral image classification accuracy and efficiency by emphasizing subtle spectral-spatial variations.
Contribution
The paper proposes DiffFormer, a new transformer-based framework with differential multi-head self-attention and 3D convolutional tokenization for hyperspectral image classification, addressing spectral redundancy and spatial discontinuity.
Findings
Outperforms existing SOTA methods in accuracy
Demonstrates high computational efficiency and scalability
Provides detailed analysis of model complexity
Abstract
Hyperspectral image classification (HSIC) has gained significant attention because of its potential in analyzing high-dimensional data with rich spectral and spatial information. In this work, we propose the Differential Spatial-Spectral Transformer (DiffFormer), a novel framework designed to address the inherent challenges of HSIC, such as spectral redundancy and spatial discontinuity. The DiffFormer leverages a Differential Multi-Head Self-Attention (DMHSA) mechanism, which enhances local feature discrimination by introducing differential attention to accentuate subtle variations across neighboring spectral-spatial patches. The architecture integrates Spectral-Spatial Tokenization through three-dimensional (3D) convolution-based patch embeddings, positional encoding, and a stack of transformer layers equipped with the SWiGLU activation function for efficient feature extraction (SwiGLU…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Absolute Position Encodings · Dense Connections · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Adam
