"Why did the Model Fail?": Attributing Model Performance Changes to   Distribution Shifts

Haoran Zhang; Harvineet Singh; Marzyeh Ghassemi; Shalmali Joshi

arXiv:2210.10769·cs.LG·June 7, 2023·1 cites

"Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts

Haoran Zhang, Harvineet Singh, Marzyeh Ghassemi, Shalmali Joshi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a method to attribute changes in machine learning model performance to specific distribution shifts in data, using a game-theoretic approach with Shapley values, validated on various datasets.

Contribution

It formulates performance attribution as a cooperative game and derives an importance weighting method to quantify each distribution's impact, a novel approach for understanding model failures.

Findings

01

Effective in attributing performance drops to distribution shifts

02

Validated on synthetic, semi-synthetic, and real-world data

03

Provides insights for diagnosing model failures

Abstract

Machine learning models frequently experience performance drops under distribution shifts. The underlying cause of such shifts may be multiple simultaneous factors such as changes in data quality, differences in specific covariate distributions, or changes in the relationship between label and features. When a model does fail during deployment, attributing performance change to these factors is critical for the model developer to identify the root cause and take mitigating actions. In this work, we introduce the problem of attributing performance differences between environments to distribution shifts in the underlying data generating mechanisms. We formulate the problem as a cooperative game where the players are distributions. We define the value of a set of distributions to be the change in model performance when only this set of distributions has changed between environments, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mlforhealth/expl_perf_drop
pytorchOfficial

Videos

"Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts· slideslive

Taxonomy

TopicsData Stream Mining Techniques · Machine Learning and Data Classification · Explainable Artificial Intelligence (XAI)