Interpretable Distribution Shift Detection using Optimal Transport

Neha Hulkund; Nicolo Fusi; Jennifer Wortman Vaughan; David; Alvarez-Melis

arXiv:2208.02896·cs.LG·August 8, 2022

Interpretable Distribution Shift Detection using Optimal Transport

Neha Hulkund, Nicolo Fusi, Jennifer Wortman Vaughan, David, Alvarez-Melis

PDF

Open Access

TL;DR

This paper introduces an optimal transport-based method for detecting and interpreting distribution shifts in classification datasets, enabling class-specific analysis and sample retrieval for better understanding of dataset changes.

Contribution

It presents a novel, interpretable approach to identify and analyze distribution shifts using optimal transport, with class-specific insights and sample retrieval capabilities.

Findings

01

Effective identification of distribution shifts in synthetic and real datasets

02

Ability to determine which classes are affected by shifts

03

Retrieval of sample pairs for interpretability

Abstract

We propose a method to identify and characterize distribution shifts in classification datasets based on optimal transport. It allows the user to identify the extent to which each class is affected by the shift, and retrieves corresponding pairs of samples to provide insights on its nature. We illustrate its use on synthetic and natural shift examples. While the results we present are preliminary, we hope that this inspires future work on interpretable methods for analyzing distribution shifts.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Fault Detection and Control Systems