Anytime Approximate Formal Feature Attribution
Jinqiang Yu, Graham Farr, Alexey Ignatiev, Peter J. Stuckey

TL;DR
This paper addresses the challenge of explainability in AI by analyzing formal feature attribution (FFA), proving its computational intractability, and proposing an efficient heuristic algorithm for approximating FFA in an anytime manner, validated through experiments.
Contribution
It proves FFA computation is #P-hard, and introduces an adaptive heuristic algorithm for efficient FFA approximation with experimental validation.
Findings
FFA computation is #P-hard even with contrastive explanations
The proposed heuristic effectively approximates FFA within fixed time limits
Experimental results show accurate FFA approximation and diverse explanations
Abstract
Widespread use of artificial intelligence (AI) algorithms and machine learning (ML) models on the one hand and a number of crucial issues pertaining to them warrant the need for explainable artificial intelligence (XAI). A key explainability question is: given this decision was made, what are the input features which contributed to the decision? Although a range of XAI approaches exist to tackle this problem, most of them have significant limitations. Heuristic XAI approaches suffer from the lack of quality guarantees, and often try to approximate Shapley values, which is not the same as explaining which features contribute to a decision. A recent alternative is so-called formal feature attribution (FFA), which defines feature importance as the fraction of formal abductive explanations (AXp's) containing the given feature. This measures feature importance from the view of formally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Topic Modeling
MethodsSparse Evolutionary Training
