Developing a Fidelity Evaluation Approach for Interpretable Machine Learning
Mythreyi Velmurugan, Chun Ouyang, Catarina Moreira, Renuka, Sindhgatta

TL;DR
This paper proposes a new three-phase approach to evaluate the fidelity of explainable AI methods, adapting existing evaluation techniques for tabular data and assessing popular methods' effectiveness.
Contribution
It introduces a structured evaluation framework for explanation fidelity in tabular data, filling a gap in current XAI assessment methods.
Findings
Explanation fidelity varies with model and data complexity.
Internal model mechanisms influence explanation accuracy.
No single explainable method is universally superior.
Abstract
Although modern machine learning and deep learning methods allow for complex and in-depth data analytics, the predictive models generated by these methods are often highly complex, and lack transparency. Explainable AI (XAI) methods are used to improve the interpretability of these complex models, and in doing so improve transparency. However, the inherent fitness of these explainable methods can be hard to evaluate. In particular, methods to evaluate the fidelity of the explanation to the underlying black box require further development, especially for tabular data. In this paper, we (a) propose a three phase approach to developing an evaluation method; (b) adapt an existing evaluation method primarily for image and text data to evaluate models trained on tabular data; and (c) evaluate two popular explainable methods using this evaluation method. Our evaluations suggest that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
MethodsShapley Additive Explanations · Local Interpretable Model-Agnostic Explanations
