Comprehensive Information Bottleneck for Unveiling Universal Attribution to Interpret Vision Transformers

Jung-Ho Hong; Ho-Joong Kim; Kyu-Sung Jeon; Seong-Whan Lee

arXiv:2507.04388·cs.CV·July 8, 2025

Comprehensive Information Bottleneck for Unveiling Universal Attribution to Interpret Vision Transformers

Jung-Ho Hong, Ho-Joong Kim, Kyu-Sung Jeon, Seong-Whan Lee

PDF

TL;DR

This paper introduces CoIBA, a comprehensive information bottleneck method that explains vision transformer decisions by capturing relevant information across multiple layers, improving attribution faithfulness.

Contribution

It proposes a novel multi-layer information bottleneck approach with shared damping ratio to better explain decision processes in vision transformers.

Findings

01

Enhanced attribution faithfulness demonstrated in experiments

02

Effective discovery of decision-relevant information across layers

03

Shared damping ratio improves information sharing across layers

Abstract

The feature attribution method reveals the contribution of input variables to the decision-making process to provide an attribution map for explanation. Existing methods grounded on the information bottleneck principle compute information in a specific layer to obtain attributions, compressing the features by injecting noise via a parametric damping ratio. However, the attribution obtained in a specific layer neglects evidence of the decision-making process distributed across layers. In this paper, we introduce a comprehensive information bottleneck (CoIBA), which discovers the relevant information in each targeted layer to explain the decision-making process. Our core idea is applying information bottleneck in multiple targeted layers to estimate the comprehensive information by sharing a parametric damping ratio across the layers. Leveraging this shared ratio complements the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.