Enhancing Neural Network Interpretability Through Conductance-Based   Information Plane Analysis

Jaouad Dabounou; Amine Baazzouz

arXiv:2408.14681·cs.LG·August 28, 2024

Enhancing Neural Network Interpretability Through Conductance-Based Information Plane Analysis

Jaouad Dabounou, Amine Baazzouz

PDF

Open Access

TL;DR

This paper introduces a conductance-based method to analyze neural network information flow, providing more precise insights into layer contributions and challenging existing theories, thereby enhancing interpretability and robustness.

Contribution

It proposes a novel conductance-based Information Plane analysis and ITE metric, improving understanding of information dynamics and feature attribution in neural networks.

Findings

01

Identifies critical layers influencing performance and interpretability

02

Reveals complex information compression and utilization patterns

03

Challenges predictions of the Information Bottleneck theory

Abstract

The Information Plane is a conceptual framework used to analyze the flow of information in neural networks, but traditional methods based on activations may not fully capture the dynamics of information processing. This paper introduces a new approach that uses layer conductance, a measure of sensitivity to input features, to enhance the Information Plane analysis. By incorporating gradient-based contributions, we provide a more precise characterization of information dynamics within the network. The proposed conductance-based Information Plane and a new Information Transformation Efficiency (ITE) metric are evaluated on pretrained ResNet50 and VGG16 models using the ImageNet dataset. Our results demonstrate the ability to identify critical hidden layers that contribute significantly to model performance and interpretability, giving insights into information compression, preservation,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Adversarial Robustness in Machine Learning · Fault Detection and Control Systems