Explaining a probabilistic prediction on the simplex with Shapley compositions
Paul-Gauthier No\'e, Miquel Perell\'o-Nieto, Jean-Fran\c{c}ois, Bonastre, Peter Flach

TL;DR
This paper introduces Shapley compositions, a novel method for explaining multiclass probabilistic predictions on the simplex, leveraging Aitchison geometry to preserve the compositional structure of the output.
Contribution
It proposes Shapley compositions as a principled way to explain multiclass probabilistic predictions, extending Shapley values to the Aitchison simplex with proven axiomatic properties.
Findings
Shapley compositions are unique under linearity, symmetry, and efficiency.
The method effectively explains multiclass probabilistic models.
Extension of Shapley values to compositional data analysis.
Abstract
Originating in game theory, Shapley values are widely used for explaining a machine learning model's prediction by quantifying the contribution of each feature's value to the prediction. This requires a scalar prediction as in binary classification, whereas a multiclass probabilistic prediction is a discrete probability distribution, living on a multidimensional simplex. In such a multiclass setting the Shapley values are typically computed separately on each class in a one-vs-rest manner, ignoring the compositional nature of the output distribution. In this paper, we introduce Shapley compositions as a well-founded way to properly explain a multiclass probabilistic prediction, using the Aitchison geometry from compositional data analysis. We prove that the Shapley composition is the unique quantity satisfying linearity, symmetry and efficiency on the Aitchison simplex, extending the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeochemistry and Geologic Mapping · Rough Sets and Fuzzy Logic · Topological and Geometric Data Analysis
