Explainability of Vision Transformers: A Comprehensive Review and New Perspectives
Rojina Kashefi, Leili Barekatain, Mohammad Sabokrou, Fatemeh, Aghaeipoor

TL;DR
This paper reviews the explainability methods for vision transformers, categorizes them, evaluates tools and criteria, and discusses future research directions to improve understanding and trust in these models.
Contribution
It provides a comprehensive taxonomy and review of explainability techniques for vision transformers, highlighting unexplored areas and proposing future research directions.
Findings
Taxonomy of explainability methods based on motivation and structure
Evaluation criteria and tools for explanation quality
Identification of unexplored aspects and future research directions
Abstract
Transformers have had a significant impact on natural language processing and have recently demonstrated their potential in computer vision. They have shown promising results over convolution neural networks in fundamental computer vision tasks. However, the scientific community has not fully grasped the inner workings of vision transformers, nor the basis for their decision-making, which underscores the importance of explainability methods. Understanding how these models arrive at their decisions not only improves their performance but also builds trust in AI systems. This study explores different explainability methods proposed for visual transformers and presents a taxonomy for organizing them according to their motivations, structures, and application scenarios. In addition, it provides a comprehensive review of evaluation criteria that can be used for comparing explanation results,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Advanced Neural Network Applications · Machine Learning in Materials Science
MethodsConvolution
