Multivariate outlier explanations using Shapley values and Mahalanobis distances
Marcus Mayrhofer, Peter Filzmoser

TL;DR
This paper introduces a method combining Shapley values and Mahalanobis distances to explain multivariate outliers, enabling efficient and interpretable identification of variable contributions to outlyingness.
Contribution
It presents a novel approach to decompose Mahalanobis distances into variable contributions using Shapley values, enhancing outlier explanation in multivariate data.
Findings
Shapley values can decompose Mahalanobis distances efficiently.
The method aids in explaining cellwise outlyingness.
Simulations and real data demonstrate practical usefulness.
Abstract
For the purpose of explaining multivariate outlyingness, it is shown that the squared Mahalanobis distance of an observation can be decomposed into outlyingness contributions originating from single variables. The decomposition is obtained using the Shapley value, a well-known concept from game theory that became popular in the context of Explainable AI. In addition to outlier explanation, this concept also relates to the recent formulation of cellwise outlyingness, where Shapley values can be employed to obtain variable contributions for outlying observations with respect to their "expected" position given the multivariate data structure. In combination with squared Mahalanobis distances, Shapley values can be calculated at a low numerical cost, making them even more attractive for outlier interpretation. Simulations and real-world data examples demonstrate the usefulness of these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Forecasting Techniques and Applications · Multi-Criteria Decision Making
