Interpretable Vision Transformers in Image Classification via SVDA

Vasileios Arampatzakis; George Pavlidis; Nikolaos Mitianoudis; Nikos Papamarkos

arXiv:2602.10994·cs.CV·February 12, 2026

Interpretable Vision Transformers in Image Classification via SVDA

Vasileios Arampatzakis, George Pavlidis, Nikolaos Mitianoudis, Nikos Papamarkos

PDF

Open Access

TL;DR

This paper introduces SVDA, a geometrically grounded attention mechanism for Vision Transformers that improves interpretability and spectral structure without reducing classification performance across multiple benchmarks.

Contribution

We adapt SVDA to Vision Transformers, enhancing interpretability and spectral structure while maintaining accuracy, and demonstrate its effectiveness on standard image classification benchmarks.

Findings

01

SVDA produces more interpretable attention patterns.

02

SVDA maintains classification accuracy across benchmarks.

03

SVDA provides spectral diagnostics for attention models.

Abstract

Vision Transformers (ViTs) have achieved state-of-the-art performance in image classification, yet their attention mechanisms often remain opaque and exhibit dense, non-structured behaviors. In this work, we adapt our previously proposed SVD-Inspired Attention (SVDA) mechanism to the ViT architecture, introducing a geometrically grounded formulation that enhances interpretability, sparsity, and spectral structure. We apply the use of interpretability indicators -- originally proposed with SVDA -- to monitor attention dynamics during training and assess structural properties of the learned representations. Experimental evaluations on four widely used benchmarks -- CIFAR-10, FashionMNIST, CIFAR-100, and ImageNet-100 -- demonstrate that SVDA consistently yields more interpretable attention patterns without sacrificing classification accuracy. While the current framework offers descriptive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning