LoLA-SpecViT: Local Attention SwiGLU Vision Transformer with LoRA for Hyperspectral Imaging
Fadi Abdeladhim Zidi, Djamel Eddine Boukhari, Abdellah Zakaria Sellam, Abdelkrim Ouafi, Cosimo Distante, Salah Eddine Bekhouche, Abdelmalik Taleb-Ahmed

TL;DR
LoLA-SpecViT is a lightweight, parameter-efficient hyperspectral image classifier that combines local attention, spectral-spatial feature extraction, and low-rank adaptation to outperform state-of-the-art methods with fewer parameters.
Contribution
The paper introduces LoLA-SpecViT, a novel spectral vision transformer with local attention and LoRA, tailored for hyperspectral imaging, enabling high accuracy with fewer trainable parameters.
Findings
Achieves up to 99.91% accuracy on benchmark datasets.
Uses 80% fewer trainable parameters than comparable models.
Demonstrates robustness under low-label conditions.
Abstract
Hyperspectral image classification remains a challenging task due to the high dimensionality of spectral data, significant inter-band redundancy, and the limited availability of annotated samples. While recent transformer-based models have improved the global modeling of spectral-spatial dependencies, their scalability and adaptability under label-scarce conditions remain limited. In this work, we propose \textbf{LoLA-SpecViT}(Low-rank adaptation Local Attention Spectral Vision Transformer), a lightweight spectral vision transformer that addresses these limitations through a parameter-efficient architecture tailored to the unique characteristics of hyperspectral imagery. Our model combines a 3D convolutional spectral front-end with local window-based self-attention, enhancing both spectral feature extraction and spatial consistency while reducing computational complexity. To further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
