Using Shapley Values and Variational Autoencoders to Explain Predictive   Models with Dependent Mixed Features

Lars Henry Berge Olsen; Ingrid Kristine Glad; Martin Jullum and; Kjersti Aas

arXiv:2111.13507·stat.ML·August 16, 2022

Using Shapley Values and Variational Autoencoders to Explain Predictive Models with Dependent Mixed Features

Lars Henry Berge Olsen, Ingrid Kristine Glad, Martin Jullum and, Kjersti Aas

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method using variational autoencoders with arbitrary conditioning to improve the estimation of Shapley values for dependent mixed features, enhancing explanation accuracy in complex models.

Contribution

We propose a VAEAC-based approach for modeling feature dependencies to accurately estimate Shapley values in dependent data settings, outperforming existing methods.

Findings

01

VAEAC approach outperforms state-of-the-art methods in simulations.

02

Significant improvements in high-dimensional settings with non-uniform masking.

03

Effective application to real-world Abalone dataset demonstrates practical utility.

Abstract

Shapley values are today extensively used as a model-agnostic explanation framework to explain complex predictive machine learning models. Shapley values have desirable theoretical properties and a sound mathematical foundation in the field of cooperative game theory. Precise Shapley value estimates for dependent data rely on accurate modeling of the dependencies between all feature combinations. In this paper, we use a variational autoencoder with arbitrary conditioning (VAEAC) to model all feature dependencies simultaneously. We demonstrate through comprehensive simulation studies that our VAEAC approach to Shapley value estimation outperforms the state-of-the-art methods for a wide range of settings for both continuous and mixed dependent features. For high-dimensional settings, our VAEAC approach with a non-uniform masking scheme significantly outperforms competing methods. Finally,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lhbo/shapleyvaluesvaeac
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Machine Learning and Data Classification