Fast Axiomatic Attribution for Neural Networks
Robin Hesse, Simone Schaub-Meyer, Stefan Roth

TL;DR
This paper introduces a class of neural networks called $ ext{X}$-DNNs that enable fast, axiomatic feature attribution with a single forward/backward pass, improving training with attribution priors.
Contribution
The authors propose $ ext{X}$-DNNs, a special class of nonnegatively homogeneous networks that allow efficient axiomatic attribution, overcoming the trade-off between attribution quality and computational cost.
Findings
$ ext{X}$-DNNs enable single-pass attribution computation.
$ ext{X}$-DNNs outperform state-of-the-art attribution methods.
Removing biases from regular DNNs constructs $ ext{X}$-DNNs.
Abstract
Mitigating the dependence on spurious correlations present in the training dataset is a quickly emerging and important topic of deep learning. Recent approaches include priors on the feature attribution of a deep neural network (DNN) into the training process to reduce the dependence on unwanted features. However, until now one needed to trade off high-quality attributions, satisfying desirable axioms, against the time required to compute them. This in turn either led to long training times or ineffective attribution priors. In this work, we break this trade-off by considering a special class of efficiently axiomatically attributable DNNs for which an axiomatic feature attribution can be computed with only a single forward/backward pass. We formally prove that nonnegatively homogeneous DNNs, here termed -DNNs, are efficiently axiomatically attributable and show that they…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Advanced Graph Neural Networks
