EvaluateXAI: A Framework to Evaluate the Reliability and Consistency of Rule-based XAI Techniques for Software Analytics Tasks
Md Abdul Awal, Chanchal K. Roy

TL;DR
This paper introduces EvaluateXAI, a framework to assess the reliability and consistency of rule-based XAI techniques like PyExplainer and LIME in explaining ML models for software analytics, revealing significant inconsistencies.
Contribution
The paper proposes a novel evaluation framework and metrics to automatically assess rule-based XAI techniques' effectiveness in software analytics tasks, highlighting their unreliability.
Findings
XAI explanations are often inconsistent and unreliable.
None of the evaluation metrics reached 100% effectiveness.
PyExplainer and LIME failed to provide consistent explanations in most cases.
Abstract
The advancement of machine learning (ML) models has led to the development of ML-based approaches to improve numerous software engineering tasks in software maintenance and evolution. Nevertheless, research indicates that despite their potential successes, ML models may not be employed in real-world scenarios because they often remain a black box to practitioners, lacking explainability in their reasoning. Recently, various rule-based model-agnostic Explainable AI (XAI) techniques, such as PyExplainer and LIME, have been employed to explain the predictions of ML models in software analytics tasks. This paper assesses the ability of these techniques (e.g., PyExplainer and LIME) to generate reliable and consistent explanations for ML models across various software analytics tasks, including Just-in-Time (JIT) defect prediction, clone detection, and the classification of useful code review…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software System Performance and Reliability · Business Process Modeling and Analysis
