Enhancing Model Interpretability with Local Attribution over Global   Exploration

Zhiyu Zhu; Zhibo Jin; Jiayu Zhang; Huaming Chen

arXiv:2408.07736·cs.LG·August 16, 2024

Enhancing Model Interpretability with Local Attribution over Global Exploration

Zhiyu Zhu, Zhibo Jin, Jiayu Zhang, Huaming Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Local Attribution (LA) algorithm that improves model interpretability by focusing on local space exploration, reducing the impact of out-of-distribution states, and outperforming existing attribution methods.

Contribution

The paper proposes a novel LA algorithm that leverages local space properties with targeted and untargeted exploration phases for better attribution accuracy.

Findings

01

Achieves 38.21% improvement in attribution effectiveness over state-of-the-art methods.

02

Effectively generates intermediate states within local space, avoiding OOD issues.

03

Extensive ablation studies confirm the importance of each component in the LA algorithm.

Abstract

In the field of artificial intelligence, AI models are frequently described as `black boxes' due to the obscurity of their internal mechanisms. It has ignited research interest on model interpretability, especially in attribution methods that offers precise explanations of model decisions. Current attribution algorithms typically evaluate the importance of each parameter by exploring the sample space. A large number of intermediate states are introduced during the exploration process, which may reach the model's Out-of-Distribution (OOD) space. Such intermediate states will impact the attribution results, making it challenging to grasp the relative importance of features. In this paper, we firstly define the local space and its relevant properties, and we propose the Local Attribution (LA) algorithm that leverages these properties. The LA algorithm comprises both targeted and untargeted…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lmbtough/la
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Processing Techniques · Reservoir Engineering and Simulation Methods · Topic Modeling