Select Wisely and Explain: Active Learning and Probabilistic Local   Post-hoc Explainability

Aditya Saini; Ranjitha Prasad

arXiv:2108.06907·cs.LG·April 25, 2022·1 cites

Select Wisely and Explain: Active Learning and Probabilistic Local Post-hoc Explainability

Aditya Saini, Ranjitha Prasad

PDF

Open Access 1 Repo

TL;DR

This paper introduces UnRAvEL, an active learning method that improves local explanations of black-box models by using uncertainty-driven sampling with Gaussian process regression, enhancing stability and fidelity.

Contribution

The paper proposes UnRAvEL, a novel active learning approach for generating reliable local explanations, with theoretical analysis and demonstrated effectiveness on real-world datasets.

Findings

01

UnRAvEL outperforms baselines in stability and local fidelity.

02

UnRAvEL effectively generates surrogate datasets for explanation.

03

Demonstrated sample efficiency on ImageNet with ResNet.

Abstract

Albeit the tremendous performance improvements in designing complex artificial intelligence (AI) systems in data-intensive domains, the black-box nature of these systems leads to the lack of trustworthiness. Post-hoc interpretability methods explain the prediction of a black-box ML model for a single instance, and such explanations are being leveraged by domain experts to diagnose the underlying biases of these models. Despite their efficacy in providing valuable insights, existing approaches fail to deliver consistent and reliable explanations. In this paper, we propose an active learning-based technique called UnRAvEL (Uncertainty driven Robust Active Learning Based Locally Faithful Explanations), which consists of a novel acquisition function that is locally faithful and uses uncertainty-driven sampling based on the posterior distribution on the probabilistic locality using Gaussian…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adityasaini70/unravel
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Healthcare

Methods*Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Residual Connection · Bottleneck Residual Block · Convolution · Residual Block · Average Pooling · Max Pooling · Kaiming Initialization