Efficient Explanations from Empirical Explainers

Robert Schwarzenberg; Nils Feldhus; Sebastian M\"oller

arXiv:2103.15429·cs.LG·September 16, 2021

Efficient Explanations from Empirical Explainers

Robert Schwarzenberg, Nils Feldhus, Sebastian M\"oller

PDF

2 Repos

TL;DR

This paper introduces Empirical Explainers that efficiently approximate costly neural explanation methods by learning from data, significantly reducing computational costs while maintaining accuracy in language applications.

Contribution

It proposes a novel feature attribution modeling approach that learns to predict explanations, offering a practical solution to reduce the computational burden of neural explainability.

Findings

01

Empirical Explainers model expensive explainers well in language tasks.

02

They achieve similar attribution accuracy at a fraction of the computational cost.

03

The approach is effective in applications tolerant to approximation errors.

Abstract

Amid a discussion about Green AI in which we see explainability neglected, we explore the possibility to efficiently approximate computationally expensive explainers. To this end, we propose feature attribution modelling with Empirical Explainers. Empirical Explainers learn from data to predict the attribution maps of expensive explainers. We train and test Empirical Explainers in the language domain and find that they model their expensive counterparts surprisingly well, at a fraction of the cost. They could thus mitigate the computational burden of neural explanations significantly, in applications that tolerate an approximation error.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.