Obtaining Example-Based Explanations from Deep Neural Networks

Genghua Dong; Henrik Bostr\"om; Michalis Vazirgiannis; Roman Bresson

arXiv:2502.19768·cs.LG·February 28, 2025

Obtaining Example-Based Explanations from Deep Neural Networks

Genghua Dong, Henrik Bostr\"om, Michalis Vazirgiannis, Roman Bresson

PDF

Open Access

TL;DR

This paper introduces EBE-DNN, a method for deriving example-based explanations from deep neural networks by leveraging embeddings and k-nearest neighbors, providing concise, accurate explanations that complement feature attribution.

Contribution

The work presents a novel technique to obtain example-based explanations from deep neural networks using embeddings and k-NN, applicable to complex models beyond traditional methods.

Findings

01

EBE-DNN provides concentrated example attributions.

02

Predictions remain accurate with few training examples.

03

Embedding layer choice significantly affects accuracy.

Abstract

Most techniques for explainable machine learning focus on feature attribution, i.e., values are assigned to the features such that their sum equals the prediction. Example attribution is another form of explanation that assigns weights to the training examples, such that their scalar product with the labels equals the prediction. The latter may provide valuable complementary information to feature attribution, in particular in cases where the features are not easily interpretable. Current example-based explanation techniques have targeted a few model types only, such as k-nearest neighbors and random forests. In this work, a technique for obtaining example-based explanations from deep neural networks (EBE-DNN) is proposed. The basic idea is to use the deep neural network to obtain an embedding, which is employed by a k-nearest neighbor classifier to form a prediction; the example…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning

MethodsFocus