Transferable XAI: Relating Understanding Across Domains with Explanation Transfer
Fei Wang, Yifan Zhang, Brian Y. Lim

TL;DR
This paper introduces Transferable XAI, a framework that enables users to transfer explanations across related domains by modeling relationships between explanations, improving understanding and decision faithfulness in multi-domain AI applications.
Contribution
It proposes a novel affine transformation framework for explanation transfer across domains, supporting subspace, task, and attribute relationships, validated through user studies.
Findings
Transferable XAI improved understanding of related domains.
Participants showed better decision faithfulness with Transferable XAI.
The framework effectively models relationships between domain explanations.
Abstract
Current Explainable AI (XAI) focuses on explaining a single application, but when encountering related applications, users may rely on their prior understanding from previous explanations. This leads to either overgeneralization and AI overreliance, or burdensome independent memorization. Indeed, related decision tasks can share explanatory factors, but with some notable differences; e.g., body mass index (BMI) affects the risks for heart disease and diabetes at the same rate, but chest pain is more indicative of heart disease. Similarly, models using different attributes for the same task still share signals; e.g., temperature and pressure affect air pollution but in opposite directions due to the ideal gas law. Leveraging transfer of learning, we propose Transferable XAI to enable users to transfer understanding across related domains by explaining the relationship between domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education · Multimodal Machine Learning Applications
