Cosine-Normalized Attention for Hyperspectral Image Classification

Muhammad Ahmad; Manuel Mazzara

arXiv:2604.01763·cs.CV·April 3, 2026

Cosine-Normalized Attention for Hyperspectral Image Classification

Muhammad Ahmad, Manuel Mazzara

PDF

TL;DR

This paper introduces a cosine-normalized attention mechanism for hyperspectral image classification, emphasizing angular relationships to improve performance over traditional dot-product methods.

Contribution

It proposes a geometric, cosine-based attention formulation integrated into a Transformer, enhancing hyperspectral classification especially under limited supervision.

Findings

01

Consistently outperforms recent Transformer-based models on benchmark datasets.

02

Emphasizes angular relationships, reducing sensitivity to feature magnitude variations.

03

Provides a reliable inductive bias for hyperspectral data representation.

Abstract

Transformer-based methods have improved hyperspectral image classification (HSIC) by modeling long-range spatial-spectral dependencies; however, their attention mechanisms typically rely on dot-product similarity, which mixes feature magnitude and orientation and may be suboptimal for hyperspectral data. This work revisits attention scoring from a geometric perspective and introduces a cosine-normalized attention formulation that aligns similarity computation with the angular structure of hyperspectral signatures. By projecting query and key embeddings onto a unit hypersphere and applying a squared cosine similarity, the proposed method emphasizes angular relationships while reducing sensitivity to magnitude variations. The formulation is integrated into a spatial-spectral Transformer and evaluated under extremely limited supervision. Experiments on three benchmark datasets demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.