A View on Out-of-Distribution Identification from a Statistical Testing   Theory Perspective

Alberto Caron; Chris Hicks; Vasilios Mavroudis

arXiv:2405.03052·cs.LG·May 13, 2024·2 cites

A View on Out-of-Distribution Identification from a Statistical Testing Theory Perspective

Alberto Caron, Chris Hicks, Vasilios Mavroudis

PDF

Open Access 1 Repo

TL;DR

This paper analyzes out-of-distribution detection using statistical testing theory, proposing a framework based on Wasserstein distance with convergence guarantees and empirical evaluation.

Contribution

It reformulates OOD detection as a statistical testing problem, establishing conditions for identifiability and analyzing convergence guarantees of Wasserstein-based tests.

Findings

01

Wasserstein distance-based test shows promising convergence properties.

02

The proposed framework clarifies conditions for reliable OOD detection.

03

Empirical evaluation supports theoretical insights.

Abstract

We study the problem of efficiently detecting Out-of-Distribution (OOD) samples at test time in supervised and unsupervised learning contexts. While ML models are typically trained under the assumption that training and test data stem from the same distribution, this is often not the case in realistic settings, thus reliably detecting distribution shifts is crucial at deployment. We re-formulate the OOD problem under the lenses of statistical testing and then discuss conditions that render the OOD problem identifiable in statistical terms. Building on this framework, we study convergence guarantees of an OOD test based on the Wasserstein distance, and provide a simple empirical evaluation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

albicaron/ood_test
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Process Monitoring