Discovering Distribution Shifts using Latent Space Representations

Leo Betthauser; Urszula Chajewska; Maurice Diesendruck; Rohith Pesala

arXiv:2202.02339·cs.LG·February 18, 2022·1 cites

Discovering Distribution Shifts using Latent Space Representations

Leo Betthauser, Urszula Chajewska, Maurice Diesendruck, Rohith Pesala

PDF

Open Access 1 Repo

TL;DR

This paper introduces a non-parametric framework using embedding space geometry to detect distribution shifts, enhancing model robustness assessment in representation learning.

Contribution

It proposes two novel tests for detecting distribution shifts based on embedding space geometry, improving practical shift detection methods.

Findings

01

Both tests effectively detect distribution shifts in synthetic and real-world datasets.

02

The methods identify shifts impacting model performance.

03

Framework is non-parametric and interpretable.

Abstract

Rapid progress in representation learning has led to a proliferation of embedding models, and to associated challenges of model selection and practical application. It is non-trivial to assess a model's generalizability to new, candidate datasets and failure to generalize may lead to poor performance on downstream tasks. Distribution shifts are one cause of reduced generalizability, and are often difficult to detect in practice. In this paper, we use the embedding space geometry to propose a non-parametric framework for detecting distribution shifts, and specify two tests. The first test detects shifts by establishing a robustness boundary, determined by an intelligible performance criterion, for comparing reference and candidate datasets. The second test detects shifts by featurizing and classifying multiple subsamples of two datasets as in-distribution and out-of-distribution. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/distribution-shift-latent-representations
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting