Explaining Machine Learning Models using Entropic Variable Projection
Fran\c{c}ois Bachoc (IMT), Fabrice Gamboa (IMT), Max Halford (IMT,, IRIT), Jean-Michel Loubes (IMT), Laurent Risser (IMT)

TL;DR
This paper introduces a novel, model-agnostic explainability framework using entropic projections to interpret how input variables influence machine learning model predictions, applicable to various models and datasets.
Contribution
It presents the first unified formalism based on information theory for explaining input variable impacts on predictions, scalable to large datasets and different model types.
Findings
Framework is model-agnostic and scalable.
Provides convergence rates for entropic projections.
Demonstrates effectiveness on diverse datasets and models.
Abstract
In this paper, we present a new explainability formalism designed to shed light on how each input variable of a test set impacts the predictions of machine learning models. Hence, we propose a group explainability formalism for trained machine learning decision rules, based on their response to the variability of the input variables distribution. In order to emphasize the impact of each input variable, this formalism uses an information theory framework that quantifies the influence of all input-output observations based on entropic projections. This is thus the first unified and model agnostic formalism enabling data scientists to interpret the dependence between the input variables, their impact on the prediction errors, and their influence on the output predictions. Convergence rates of the entropic projections are provided in the large sample case. Most importantly, we prove that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Machine Learning in Healthcare
MethodsShapley Additive Explanations · Local Interpretable Model-Agnostic Explanations
