A Dual-Perspective Approach to Evaluating Feature Attribution Methods

Yawei Li; Yang Zhang; Kenji Kawaguchi; Ashkan Khakzar; Bernd Bischl,; Mina Rezaei

arXiv:2308.08949·cs.LG·November 26, 2024

A Dual-Perspective Approach to Evaluating Feature Attribution Methods

Yawei Li, Yang Zhang, Kenji Kawaguchi, Ashkan Khakzar, Bernd Bischl,, Mina Rezaei

PDF

Open Access 1 Repo

TL;DR

This paper introduces a dual-perspective framework for evaluating feature attribution methods in neural networks, focusing on faithfulness, soundness, and completeness to improve assessment accuracy.

Contribution

It proposes two new, mathematically grounded perspectives—soundness and completeness—for evaluating the quality of feature attributions.

Findings

01

New metrics for soundness and completeness introduced

02

Applied metrics to mainstream attribution methods

03

Enhanced understanding of attribution method effectiveness

Abstract

Feature attribution methods attempt to explain neural network predictions by identifying relevant features. However, establishing a cohesive framework for assessing feature attribution remains a challenge. There are several views through which we can evaluate attributions. One principal lens is to observe the effect of perturbing attributed features on the model's behavior (i.e., faithfulness). While providing useful insights, existing faithfulness evaluations suffer from shortcomings that we reveal in this paper. In this work, we propose two new perspectives within the faithfulness paradigm that reveal intuitive properties: soundness and completeness. Soundness assesses the degree to which attributed features are truly predictive features, while completeness examines how well the resulting attribution reveals all the predictive features. The two perspectives are based on a firm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sandylaker/soco
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Materials Science · Adversarial Robustness in Machine Learning