Strengthening Interpretability: An Investigative Study of Integrated   Gradient Methods

Shree Singhi; Anupriya Kumari

arXiv:2409.09043·math.NA·September 17, 2024

Strengthening Interpretability: An Investigative Study of Integrated Gradient Methods

Shree Singhi, Anupriya Kumari

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the IDGI framework's effectiveness in improving Integrated Gradients explanations, analyzing its theoretical properties, numerical stability, and performance across different step counts in a comprehensive reproducibility study.

Contribution

It provides the first rigorous analysis and extensive experimental validation of the IDGI framework, enhancing the reliability of IG-based interpretability methods.

Findings

01

IDGI reduces explanation noise in IG methods

02

IDGI outperforms standard IG in key metrics

03

Numerical stability varies with step count

Abstract

We conducted a reproducibility study on Integrated Gradients (IG) based methods and the Important Direction Gradient Integration (IDGI) framework. IDGI eliminates the explanation noise in each step of the computation of IG-based methods that use the Riemann Integration for integrated gradient computation. We perform a rigorous theoretical analysis of IDGI and raise a few critical questions that we later address through our study. We also experimentally verify the authors' claims concerning the performance of IDGI over IG-based methods. Additionally, we varied the number of steps used in the Riemann approximation, an essential parameter in all IG methods, and analyzed the corresponding change in results. We also studied the numerical instability of the attribution methods to check the consistency of the saliency maps produced. We developed the complete code to implement IDGI over the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ShreeSinghi/RiemannOpt
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning