A General Taylor Framework for Unifying and Revisiting Attribution   Methods

Huiqi Deng; Na Zou; Mengnan Du; Weifu Chen; Guocan Feng; Xia Hu

arXiv:2105.13841·cs.LG·February 28, 2023·1 cites

A General Taylor Framework for Unifying and Revisiting Attribution Methods

Huiqi Deng, Na Zou, Mengnan Du, Weifu Chen, Guocan Feng, Xia Hu

PDF

Open Access

TL;DR

This paper introduces a Taylor-based theoretical framework to unify and analyze various attribution methods for deep learning models, providing insights into their principles, limitations, and performance.

Contribution

It proposes a general Taylor attribution framework that unifies 14 attribution methods and establishes principles for effective attribution, bridging theoretical understanding and practical evaluation.

Findings

01

Reformulates 14 attribution methods within the Taylor framework

02

Identifies three key principles for good attribution methods

03

Shows a positive correlation between principles followed and attribution performance

Abstract

Attribution methods provide an insight into the decision-making process of machine learning models, especially deep neural networks, by assigning contribution scores to each individual feature. However, the attribution problem has not been well-defined, which lacks a unified guideline to the contribution assignment process. Furthermore, existing attribution methods often built upon various empirical intuitions and heuristics. There still lacks a general theoretical framework that not only can offer a good description of the attribution problem, but also can be applied to unifying and revisiting existing attribution methods. To bridge the gap, in this paper, we propose a Taylor attribution framework, which models the attribution problem as how to decide individual payoffs in a coalition. Then, we reformulate fourteen mainstream attribution methods into the Taylor framework and analyze…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Bayesian Modeling and Causal Inference