Reliable Explanations or Random Noise? A Reliability Metric for XAI
Poushali Sengupta, Sabita Maharjan, Frank Eliassen, Shashi Raj Pandey, Yan Zhang

TL;DR
This paper introduces the Explanation Reliability Index (ERI), a set of metrics to evaluate the stability of explanations in XAI, revealing widespread unreliability in popular methods like SHAP and IG under realistic conditions.
Contribution
The paper proposes ERI, a novel framework with formal guarantees and benchmarks for assessing explanation stability in XAI, addressing a critical gap in explanation reliability measurement.
Findings
Popular explanation methods often produce unstable explanations.
ERI metrics reveal significant reliability issues in current XAI techniques.
The benchmark enables systematic testing of explanation robustness.
Abstract
In recent years, explaining decisions made by complex machine learning models has become essential in high-stakes domains such as energy systems, healthcare, finance, and autonomous systems. However, the reliability of these explanations, namely, whether they remain stable and consistent under realistic, non-adversarial changes, remains largely unmeasured. Widely used methods such as SHAP and Integrated Gradients (IG) are well-motivated by axiomatic notions of attribution, yet their explanations can vary substantially even under system-level conditions, including small input perturbations, correlated representations, and minor model updates. Such variability undermines explanation reliability, as reliable explanations should remain consistent across equivalent input representations and small, performance-preserving model changes. We introduce the Explanation Reliability Index (ERI), a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Graph Neural Networks
