Restricting the Flow: Information Bottlenecks for Attribution

Karl Schulz; Leon Sixt; Federico Tombari; Tim Landgraf

arXiv:2001.00396·stat.ML·May 26, 2020·55 cites

Restricting the Flow: Information Bottlenecks for Attribution

Karl Schulz, Leon Sixt, Federico Tombari, Tim Landgraf

PDF

Open Access 4 Repos

TL;DR

This paper introduces an information bottleneck approach for attribution in neural networks, quantifying the importance of input regions in bits and outperforming existing methods across multiple metrics.

Contribution

It adapts the information bottleneck concept for attribution, providing an information-theoretic measure and guarantees for input region importance in neural network decisions.

Findings

01

Outperforms ten baselines in most settings

02

Provides an absolute information measure in bits

03

Guarantees non-essential regions have near-zero relevance

Abstract

Attribution methods provide insights into the decision-making of machine learning models like artificial neural networks. For a given input sample, they assign a relevance score to each individual input variable, such as the pixels of an image. In this work we adapt the information bottleneck concept for attribution. By adding noise to intermediate feature maps we restrict the flow of information and can quantify (in bits) how much information image regions provide. We compare our method against ten baselines using three different metrics on VGG-16 and ResNet-50, and find that our methods outperform all baselines in five out of six settings. The method's information-theoretic foundation provides an absolute frame of reference for attribution values (bits) and a guarantee that regions scored close to zero are not necessary for the network's decision. For reviews:…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications