Precise Benchmarking of Explainable AI Attribution Methods
Rafa\"el Brandt, Daan Raatjens, Georgi Gaydadjiev

TL;DR
This paper introduces a new benchmarking approach for explainable AI attribution methods using synthetic models with ground truth explanations, enabling precise and unbiased evaluation of explanation quality.
Contribution
The authors propose a novel evaluation framework with high-fidelity metrics and synthetic models, providing unbiased, detailed insights into XAI attribution methods' performance.
Findings
Guided-Backprop and Smoothgrad have high positive contribution scores.
Both methods show poor precision for negative contributions.
The proposed metrics are computationally efficient.
Abstract
The rationale behind a deep learning model's output is often difficult to understand by humans. EXplainable AI (XAI) aims at solving this by developing methods that improve interpretability and explainability of machine learning models. Reliable evaluation metrics are needed to assess and compare different XAI methods. We propose a novel evaluation approach for benchmarking state-of-the-art XAI attribution methods. Our proposal consists of a synthetic classification model accompanied by its derived ground truth explanations allowing high precision representation of input nodes contributions. We also propose new high-fidelity metrics to quantify the difference between explanations of the investigated XAI method and those derived from the synthetic model. Our metrics allow assessment of explanations in terms of precision and recall separately. Also, we propose metrics to independently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
