Select Wisely and Explain: Active Learning and Probabilistic Local Post-hoc Explainability
Aditya Saini, Ranjitha Prasad

TL;DR
This paper introduces UnRAvEL, an active learning method that improves local explanations of black-box models by using uncertainty-driven sampling with Gaussian process regression, enhancing stability and fidelity.
Contribution
The paper proposes UnRAvEL, a novel active learning approach for generating reliable local explanations, with theoretical analysis and demonstrated effectiveness on real-world datasets.
Findings
UnRAvEL outperforms baselines in stability and local fidelity.
UnRAvEL effectively generates surrogate datasets for explanation.
Demonstrated sample efficiency on ImageNet with ResNet.
Abstract
Albeit the tremendous performance improvements in designing complex artificial intelligence (AI) systems in data-intensive domains, the black-box nature of these systems leads to the lack of trustworthiness. Post-hoc interpretability methods explain the prediction of a black-box ML model for a single instance, and such explanations are being leveraged by domain experts to diagnose the underlying biases of these models. Despite their efficacy in providing valuable insights, existing approaches fail to deliver consistent and reliable explanations. In this paper, we propose an active learning-based technique called UnRAvEL (Uncertainty driven Robust Active Learning Based Locally Faithful Explanations), which consists of a novel acquisition function that is locally faithful and uses uncertainty-driven sampling based on the posterior distribution on the probabilistic locality using Gaussian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Healthcare
Methods*Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Residual Connection · Bottleneck Residual Block · Convolution · Residual Block · Average Pooling · Max Pooling · Kaiming Initialization
