DARE: Towards Robust Text Explanations in Biomedical and Healthcare   Applications

Adam Ivankay; Mattia Rigotti; Pascal Frossard

arXiv:2307.02094·cs.LG·July 6, 2023

DARE: Towards Robust Text Explanations in Biomedical and Healthcare Applications

Adam Ivankay, Mattia Rigotti, Pascal Frossard

PDF

Open Access 1 Repo

TL;DR

This paper introduces DARE, a domain-adaptive robustness estimator for explanations in biomedical AI, and proposes training methods to improve the robustness and plausibility of model explanations in high-stakes biomedical applications.

Contribution

We develop DARE, a domain-specific attribution robustness estimator, and propose adversarial and FAR training methods to enhance explanation robustness in biomedical deep learning models.

Findings

01

DARE effectively characterizes explanation robustness in biomedical datasets.

02

Proposed training methods improve attribution faithfulness and plausibility.

03

Extensive experiments validate the effectiveness of our approaches.

Abstract

Along with the successful deployment of deep neural networks in several application domains, the need to unravel the black-box nature of these networks has seen a significant increase recently. Several methods have been introduced to provide insight into the inference process of deep neural networks. However, most of these explainability methods have been shown to be brittle in the face of adversarial perturbations of their inputs in the image and generic textual domain. In this work we show that this phenomenon extends to specific and important high stakes domains like biomedical datasets. In particular, we observe that the robustness of explanations should be characterized in terms of the accuracy of the explanation in linking a model's inputs and its decisions - faithfulness - and its relevance from the perspective of domain experts - plausibility. This is crucial to prevent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ibm/domain-adaptive-attribution-robustness
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning