Analogies and Feature Attributions for Model Agnostic Explanation of   Similarity Learners

Karthikeyan Natesan Ramamurthy; Amit Dhurandhar; Dennis Wei; Zaid Bin; Tariq

arXiv:2202.01153·cs.LG·February 3, 2022

Analogies and Feature Attributions for Model Agnostic Explanation of Similarity Learners

Karthikeyan Natesan Ramamurthy, Amit Dhurandhar, Dennis Wei, Zaid Bin, Tariq

PDF

Open Access

TL;DR

This paper introduces model-agnostic local explanations for similarity learners, proposing feature attributions and analogies to interpret model predictions for tabular and text data, with efficient search and practical applications.

Contribution

It presents a novel analogy-based explanation method and connects it with feature attributions, providing efficient, model-agnostic interpretability for similarity models.

Findings

01

Analogies effectively explain similarity predictions.

02

Proposed methods are applicable to text and healthcare data.

03

Analyses show improved interpretability and user understanding.

Abstract

Post-hoc explanations for black box models have been studied extensively in classification and regression settings. However, explanations for models that output similarity between two inputs have received comparatively lesser attention. In this paper, we provide model agnostic local explanations for similarity learners applicable to tabular and text data. We first propose a method that provides feature attributions to explain the similarity between a pair of inputs as determined by a black box similarity learner. We then propose analogies as a new form of explanation in machine learning. Here the goal is to identify diverse analogous pairs of examples that share the same level of similarity as the input pair and provide insight into (latent) factors underlying the model's prediction. The selection of analogies can optionally leverage feature attributions, thus connecting the two forms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Topic Modeling