DARE: Towards Robust Text Explanations in Biomedical and Healthcare Applications
Adam Ivankay, Mattia Rigotti, Pascal Frossard

TL;DR
This paper introduces DARE, a domain-adaptive robustness estimator for explanations in biomedical AI, and proposes training methods to improve the robustness and plausibility of model explanations in high-stakes biomedical applications.
Contribution
We develop DARE, a domain-specific attribution robustness estimator, and propose adversarial and FAR training methods to enhance explanation robustness in biomedical deep learning models.
Findings
DARE effectively characterizes explanation robustness in biomedical datasets.
Proposed training methods improve attribution faithfulness and plausibility.
Extensive experiments validate the effectiveness of our approaches.
Abstract
Along with the successful deployment of deep neural networks in several application domains, the need to unravel the black-box nature of these networks has seen a significant increase recently. Several methods have been introduced to provide insight into the inference process of deep neural networks. However, most of these explainability methods have been shown to be brittle in the face of adversarial perturbations of their inputs in the image and generic textual domain. In this work we show that this phenomenon extends to specific and important high stakes domains like biomedical datasets. In particular, we observe that the robustness of explanations should be characterized in terms of the accuracy of the explanation in linking a model's inputs and its decisions - faithfulness - and its relevance from the perspective of domain experts - plausibility. This is crucial to prevent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
