Unifying Attribution-Based Explanations Using Functional Decomposition

Arne Gevaert; Yvan Saeys

arXiv:2412.13623·cs.LG·December 19, 2024

Unifying Attribution-Based Explanations Using Functional Decomposition

Arne Gevaert, Yvan Saeys

PDF

Open Access

TL;DR

This paper introduces a unifying framework for attribution-based explanation methods in machine learning, linking them through functional decomposition and game theory to better understand their properties and guide method selection.

Contribution

It proposes a comprehensive framework that unifies various explanation methods via canonical additive decomposition and game theory, enabling systematic analysis and development of new explanations.

Findings

01

Most existing attribution methods are special cases of removal-based attribution methods.

02

Every valid additive decomposition corresponds to a canonical additive decomposition (CAD).

03

The framework allows for the derivation of new explanation methods and their properties.

Abstract

The black box problem in machine learning has led to the introduction of an ever-increasing set of explanation methods for complex models. These explanations have different properties, which in turn has led to the problem of method selection: which explanation method is most suitable for a given use case? In this work, we propose a unifying framework of attribution-based explanation methods, which provides a step towards a rigorous study of the similarities and differences of explanations. We first introduce removal-based attribution methods (RBAMs), and show that an extensively broad selection of existing methods can be viewed as such RBAMs. We then introduce the canonical additive decomposition (CAD). This is a general construction for additively decomposing any function based on the central idea of removing (groups of) features. We proceed to show that indeed every valid additive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI)

MethodsSparse Evolutionary Training