Explaining Machine Learning Models using Entropic Variable Projection

Fran\c{c}ois Bachoc (IMT); Fabrice Gamboa (IMT); Max Halford (IMT,; IRIT); Jean-Michel Loubes (IMT); Laurent Risser (IMT)

arXiv:1810.07924·stat.ML·August 12, 2022

Explaining Machine Learning Models using Entropic Variable Projection

Fran\c{c}ois Bachoc (IMT), Fabrice Gamboa (IMT), Max Halford (IMT,, IRIT), Jean-Michel Loubes (IMT), Laurent Risser (IMT)

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel, model-agnostic explainability framework using entropic projections to interpret how input variables influence machine learning model predictions, applicable to various models and datasets.

Contribution

It presents the first unified formalism based on information theory for explaining input variable impacts on predictions, scalable to large datasets and different model types.

Findings

01

Framework is model-agnostic and scalable.

02

Provides convergence rates for entropic projections.

03

Demonstrates effectiveness on diverse datasets and models.

Abstract

In this paper, we present a new explainability formalism designed to shed light on how each input variable of a test set impacts the predictions of machine learning models. Hence, we propose a group explainability formalism for trained machine learning decision rules, based on their response to the variability of the input variables distribution. In order to emphasize the impact of each input variable, this formalism uses an information theory framework that quantifies the influence of all input-output observations based on entropic projections. This is thus the first unified and model agnostic formalism enabling data scientists to interpret the dependence between the input variables, their impact on the prediction errors, and their influence on the output predictions. Convergence rates of the entropic projections are provided in the large sample case. Most importantly, we prove that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Machine Learning in Healthcare

MethodsShapley Additive Explanations · Local Interpretable Model-Agnostic Explanations