Which Explanation Should I Choose? A Function Approximation Perspective to Characterizing Post Hoc Explanations
Tessa Han, Suraj Srinivas, Himabindu Lakkaraju

TL;DR
This paper unifies eight popular post hoc explanation methods under a local function approximation framework, clarifying their differences, limitations, and guiding their practical selection based on faithfulness to the black-box model.
Contribution
It demonstrates that these explanation methods share a common local function approximation goal, introduces a no free lunch theorem, and offers a principled way to select suitable methods.
Findings
Unified explanation methods under a common framework
Proved a no free lunch theorem for explanation methods
Provided practical guidelines for method selection
Abstract
A critical problem in the field of post hoc explainability is the lack of a common foundational goal among methods. For example, some methods are motivated by function approximation, some by game theoretic notions, and some by obtaining clean visualizations. This fragmentation of goals causes not only an inconsistent conceptual understanding of explanations but also the practical challenge of not knowing which method to use when. In this work, we begin to address these challenges by unifying eight popular post hoc explanation methods (LIME, C-LIME, KernelSHAP, Occlusion, Vanilla Gradients, Gradients x Input, SmoothGrad, and Integrated Gradients). We show that these methods all perform local function approximation of the black-box model, differing only in the neighbourhood and loss function used to perform the approximation. This unification enables us to (1) state a no free lunch…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Data Stream Mining Techniques · Advanced Bandit Algorithms Research
MethodsShapley Additive Explanations · High-Order Consensuses
