Decision-Aware Attention Propagation for Vision Transformer Explainability
Sehyeong Jo, Gangjae Jang, Haesol Park

TL;DR
This paper introduces Decision-Aware Attention Propagation (DAP), a novel method that enhances Vision Transformer interpretability by integrating decision-relevant information into attention mechanisms, resulting in more accurate and class-sensitive explanations.
Contribution
The paper proposes DAP, a new attribution technique that combines gradient-based token importance with attention propagation to improve ViT explanation quality.
Findings
DAP produces more class discriminative attribution maps.
DAP outperforms existing attention-based explanation methods.
Experiments show DAP's effectiveness across various ViT models.
Abstract
Vision Transformers (ViTs) have become a dominant architecture in computer vision, yet their prediction process remains difficult to interpret because information is propagated through complex interactions across layers and attention heads. Existing attention based explanation methods provide an intuitive way to trace information flow. However, they rely mainly on raw attention weights, which do not explicitly reflect the final decision and often lead to explanations with limited class discriminability. In contrast, gradient based localization methods are more effective at highlighting class specific evidence, but they do not fully exploit the hierarchical attention propagation mechanism of transformers. To address this limitation, we propose Decision-Aware Attention Propagation (DAP), an attribution method that injects decision-relevant priors into transformer attention propagation. By…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
