DeepFaith: A Domain-Free and Model-Agnostic Unified Framework for Highly Faithful Explanations

Yuhan Guo; Lizhong Ding; Shihan Jia; Yanyu Ren; Pengqi Li; Jiarun Fu; Changsheng Li; Ye yuan; Guoren Wang

arXiv:2508.03586·cs.LG·August 6, 2025

DeepFaith: A Domain-Free and Model-Agnostic Unified Framework for Highly Faithful Explanations

Yuhan Guo, Lizhong Ding, Shihan Jia, Yanyu Ren, Pengqi Li, Jiarun Fu, Changsheng Li, Ye yuan, Guoren Wang

PDF

TL;DR

DeepFaith introduces a unified, model-agnostic framework that optimizes faithfulness across multiple metrics, providing highly faithful explanations without model access, and demonstrating superior performance across diverse tasks.

Contribution

It proposes a theoretically grounded, domain-free, and model-agnostic explanation framework that unifies multiple faithfulness metrics for optimal explanation quality.

Findings

01

Achieves highest faithfulness across 10 metrics on 12 tasks

02

Operates without model access after training

03

Demonstrates cross-domain generalizability

Abstract

Explainable AI (XAI) builds trust in complex systems through model attribution methods that reveal the decision rationale. However, due to the absence of a unified optimal explanation, existing XAI methods lack a ground truth for objective evaluation and optimization. To address this issue, we propose Deep architecture-based Faith explainer (DeepFaith), a domain-free and model-agnostic unified explanation framework under the lens of faithfulness. By establishing a unified formulation for multiple widely used and well-validated faithfulness metrics, we derive an optimal explanation objective whose solution simultaneously achieves optimal faithfulness across these metrics, thereby providing a ground truth from a theoretical perspective. We design an explainer learning framework that leverages multiple existing explanation methods, applies deduplicating and filtering to construct…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.