LoLA-SpecViT: Local Attention SwiGLU Vision Transformer with LoRA for Hyperspectral Imaging

Fadi Abdeladhim Zidi; Djamel Eddine Boukhari; Abdellah Zakaria Sellam; Abdelkrim Ouafi; Cosimo Distante; Salah Eddine Bekhouche; Abdelmalik Taleb-Ahmed

arXiv:2506.17759·cs.CV·November 4, 2025

LoLA-SpecViT: Local Attention SwiGLU Vision Transformer with LoRA for Hyperspectral Imaging

Fadi Abdeladhim Zidi, Djamel Eddine Boukhari, Abdellah Zakaria Sellam, Abdelkrim Ouafi, Cosimo Distante, Salah Eddine Bekhouche, Abdelmalik Taleb-Ahmed

PDF

TL;DR

LoLA-SpecViT is a lightweight, parameter-efficient hyperspectral image classifier that combines local attention, spectral-spatial feature extraction, and low-rank adaptation to outperform state-of-the-art methods with fewer parameters.

Contribution

The paper introduces LoLA-SpecViT, a novel spectral vision transformer with local attention and LoRA, tailored for hyperspectral imaging, enabling high accuracy with fewer trainable parameters.

Findings

01

Achieves up to 99.91% accuracy on benchmark datasets.

02

Uses 80% fewer trainable parameters than comparable models.

03

Demonstrates robustness under low-label conditions.

Abstract

Hyperspectral image classification remains a challenging task due to the high dimensionality of spectral data, significant inter-band redundancy, and the limited availability of annotated samples. While recent transformer-based models have improved the global modeling of spectral-spatial dependencies, their scalability and adaptability under label-scarce conditions remain limited. In this work, we propose \textbf{LoLA-SpecViT}(Low-rank adaptation Local Attention Spectral Vision Transformer), a lightweight spectral vision transformer that addresses these limitations through a parameter-efficient architecture tailored to the unique characteristics of hyperspectral imagery. Our model combines a 3D convolutional spectral front-end with local window-based self-attention, enhancing both spectral feature extraction and spatial consistency while reducing computational complexity. To further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.