F-Fidelity: A Robust Framework for Faithfulness Evaluation of   Explainable AI

Xu Zheng; Farhad Shirani; Zhuomin Chen; Chaohao Lin; Wei Cheng; Wenbo; Guo; Dongsheng Luo

arXiv:2410.02970·cs.LG·March 10, 2025

F-Fidelity: A Robust Framework for Faithfulness Evaluation of Explainable AI

Xu Zheng, Farhad Shirani, Zhuomin Chen, Chaohao Lin, Wei Cheng, Wenbo, Guo, Dongsheng Luo

PDF

Open Access

TL;DR

F-Fidelity is a new evaluation framework for XAI methods that addresses OOD issues and information leakage, providing more reliable assessments across various data types.

Contribution

It introduces a robust, explanation-agnostic fine-tuning and masking approach to improve the fidelity evaluation of XAI techniques.

Findings

01

F-Fidelity outperforms prior metrics in ranking explainers accurately.

02

It can estimate the true size of influential input components.

03

The framework is validated across images, time series, and language data.

Abstract

Recent research has developed a number of eXplainable AI (XAI) techniques, such as gradient-based approaches, input perturbation-base methods, and black-box explanation methods. While these XAI techniques can extract meaningful insights from deep learning models, how to properly evaluate them remains an open problem. The most widely used approach is to perturb or even remove what the XAI method considers to be the most important features in an input and observe the changes in the output prediction. This approach, although straightforward, suffers the Out-of-Distribution (OOD) problem as the perturbed samples may no longer follow the original data distribution. A recent method RemOve And Retrain (ROAR) solves the OOD issue by retraining the model with perturbed samples guided by explanations. However, using the model retrained based on XAI methods to evaluate these explainers may cause…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Adversarial Robustness in Machine Learning