On Formal Feature Attribution and Its Approximation
Jinqiang Yu, Alexey Ignatiev, Peter J. Stuckey

TL;DR
This paper introduces a formal approach to feature attribution in AI models, proposing an efficient approximation method that improves interpretability and addresses limitations of existing techniques.
Contribution
It develops a formal feature attribution framework and an efficient approximation method, enhancing scalability and practical applicability over prior formal and non-formal methods.
Findings
Approximate FFA outperforms existing algorithms in feature importance accuracy.
The method provides more consistent feature attribution rankings.
Experimental results demonstrate improved interpretability and efficiency.
Abstract
Recent years have witnessed the widespread use of artificial intelligence (AI) algorithms and machine learning (ML) models. Despite their tremendous success, a number of vital problems like ML model brittleness, their fairness, and the lack of interpretability warrant the need for the active developments in explainable artificial intelligence (XAI) and formal ML model verification. The two major lines of work in XAI include feature selection methods, e.g. Anchors, and feature attribution techniques, e.g. LIME and SHAP. Despite their promise, most of the existing feature selection and attribution approaches are susceptible to a range of critical issues, including explanation unsoundness and out-of-distribution sampling. A recent formal approach to XAI (FXAI) although serving as an alternative to the above and free of these issues suffers from a few other limitations. For instance and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI) · Rough Sets and Fuzzy Logic
MethodsFeature Selection · Local Interpretable Model-Agnostic Explanations · Shapley Additive Explanations
