Multivariate outlier explanations using Shapley values and Mahalanobis   distances

Marcus Mayrhofer; Peter Filzmoser

arXiv:2210.10063·stat.ME·March 17, 2025

Multivariate outlier explanations using Shapley values and Mahalanobis distances

Marcus Mayrhofer, Peter Filzmoser

PDF

Open Access

TL;DR

This paper introduces a method combining Shapley values and Mahalanobis distances to explain multivariate outliers, enabling efficient and interpretable identification of variable contributions to outlyingness.

Contribution

It presents a novel approach to decompose Mahalanobis distances into variable contributions using Shapley values, enhancing outlier explanation in multivariate data.

Findings

01

Shapley values can decompose Mahalanobis distances efficiently.

02

The method aids in explaining cellwise outlyingness.

03

Simulations and real data demonstrate practical usefulness.

Abstract

For the purpose of explaining multivariate outlyingness, it is shown that the squared Mahalanobis distance of an observation can be decomposed into outlyingness contributions originating from single variables. The decomposition is obtained using the Shapley value, a well-known concept from game theory that became popular in the context of Explainable AI. In addition to outlier explanation, this concept also relates to the recent formulation of cellwise outlyingness, where Shapley values can be employed to obtain variable contributions for outlying observations with respect to their "expected" position given the multivariate data structure. In combination with squared Mahalanobis distances, Shapley values can be calculated at a low numerical cost, making them even more attractive for outlier interpretation. Simulations and real-world data examples demonstrate the usefulness of these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Methods and Models · Forecasting Techniques and Applications · Multi-Criteria Decision Making