Developing Guidelines for Functionally-Grounded Evaluation of Explainable Artificial Intelligence using Tabular Data
Mythreyi Velmurugan, Chun Ouyang, Yue Xu, Renuka Sindhgatta, and Bemali Wickramanayake, Catarina Moreira

TL;DR
This paper develops guidelines for evaluating explainable AI techniques on tabular data, addressing the lack of standardized methods and identifying key evaluation criteria and research gaps.
Contribution
It systematically reviews XAI evaluation literature to propose functionally-grounded assessment guidelines specifically for tabular data applications.
Findings
Identified 20 evaluation criteria for XAI on tabular data
Provided guidelines on applying each evaluation criterion
Highlighted key research gaps in XAI evaluation
Abstract
Explainable Artificial Intelligence (XAI) techniques are used to provide transparency to complex, opaque predictive models. However, these techniques are often designed for image and text data, and it is unclear how fit-for-purpose they are when applied to tabular data. As XAI techniques are rarely evaluated in settings with tabular data, the applicability of existing evaluation criteria and methods are also unclear and needs (re-)examination. For example, some works suggest that evaluation methods may unduly influence the evaluation results when using tabular data. This lack of clarity on evaluation procedures can lead to reduced transparency and ineffective use of XAI techniques in real world settings. In this study, we examine literature on XAI evaluation to derive guidelines on functionally-grounded assessment of local, post hoc XAI techniques. We identify 20 evaluation criteria and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Big Data Technologies and Applications
MethodsHigh-Order Consensuses
