IDGI: A Framework to Eliminate Explanation Noise from Integrated   Gradients

Ruo Yang; Binghui Wang; Mustafa Bilgic

arXiv:2303.14242·cs.CV·March 28, 2023·1 cites

IDGI: A Framework to Eliminate Explanation Noise from Integrated Gradients

Ruo Yang, Binghui Wang, Mustafa Bilgic

PDF

Open Access 1 Repo

TL;DR

IDGI is a new framework that significantly reduces explanation noise in Integrated Gradients, enhancing interpretability of deep neural network decisions by analytically identifying and eliminating noise sources.

Contribution

The paper introduces IDGI, a novel framework that can be integrated into existing IG methods to minimize explanation noise based on analytical insights.

Findings

01

IDGI improves interpretability metrics across multiple IG-based methods.

02

Extensive experiments demonstrate significant noise reduction.

03

IDGI enhances the clarity of neural network explanations.

Abstract

Integrated Gradients (IG) as well as its variants are well-known techniques for interpreting the decisions of deep neural networks. While IG-based approaches attain state-of-the-art performance, they often integrate noise into their explanation saliency maps, which reduce their interpretability. To minimize the noise, we examine the source of the noise analytically and propose a new approach to reduce the explanation noise based on our analytical findings. We propose the Important Direction Gradient Integration (IDGI) framework, which can be easily incorporated into any IG-based method that uses the Reimann Integration for integrated gradient computation. Extensive experiments with three IG-based methods show that IDGI improves them drastically on numerous interpretability metrics.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yangruo1226/idgi
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications