Which Explanation Should I Choose? A Function Approximation Perspective   to Characterizing Post Hoc Explanations

Tessa Han; Suraj Srinivas; Himabindu Lakkaraju

arXiv:2206.01254·cs.LG·January 2, 2023·28 cites

Which Explanation Should I Choose? A Function Approximation Perspective to Characterizing Post Hoc Explanations

Tessa Han, Suraj Srinivas, Himabindu Lakkaraju

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper unifies eight popular post hoc explanation methods under a local function approximation framework, clarifying their differences, limitations, and guiding their practical selection based on faithfulness to the black-box model.

Contribution

It demonstrates that these explanation methods share a common local function approximation goal, introduces a no free lunch theorem, and offers a principled way to select suitable methods.

Findings

01

Unified explanation methods under a common framework

02

Proved a no free lunch theorem for explanation methods

03

Provided practical guidelines for method selection

Abstract

A critical problem in the field of post hoc explainability is the lack of a common foundational goal among methods. For example, some methods are motivated by function approximation, some by game theoretic notions, and some by obtaining clean visualizations. This fragmentation of goals causes not only an inconsistent conceptual understanding of explanations but also the practical challenge of not knowing which method to use when. In this work, we begin to address these challenges by unifying eight popular post hoc explanation methods (LIME, C-LIME, KernelSHAP, Occlusion, Vanilla Gradients, Gradients x Input, SmoothGrad, and Integrated Gradients). We show that these methods all perform local function approximation of the black-box model, differing only in the neighbourhood and loss function used to perform the approximation. This unification enables us to (1) state a no free lunch…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AI4LIFE-GROUP/lfa
pytorchOfficial

Videos

Which Explanation Should I Choose? A Function Approximation Perspective to Characterizing Post Hoc Explanations· slideslive

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Data Stream Mining Techniques · Advanced Bandit Algorithms Research

MethodsShapley Additive Explanations · High-Order Consensuses