How Much Can We See? A Note on Quantifying Explainability of Machine   Learning Models

Gero Szepannek

arXiv:1910.13376·stat.ML·December 17, 2019·5 cites

How Much Can We See? A Note on Quantifying Explainability of Machine Learning Models

Gero Szepannek

PDF

Open Access

TL;DR

This paper introduces a framework to quantify the explainability of machine learning models using partial dependence plots, helping determine how well these visualizations can explain model predictions.

Contribution

It develops a novel framework to measure the extent to which PDPs can explain black box models' predictions, addressing a key question in model interpretability.

Findings

01

Framework quantifies explainability of models

02

Enables judgment on sufficiency of explanations

03

Assists in evaluating visualization effectiveness

Abstract

One of the most popular approaches to understanding feature effects of modern black box machine learning models are partial dependence plots (PDP). These plots are easy to understand but only able to visualize low order dependencies. The paper is about the question 'How much can we see?': A framework is developed to quantify the explainability of arbitrary machine learning models, i.e. up to what degree the visualization as given by a PDP is able to explain the predictions of the model. The result allows for a judgement whether an attempt to explain a black box model is sufficient or not.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning