Handling Out-of-Distribution Data: A Survey

Lakpa Tamang; Mohamed Reda Bouadjenek; Richard Dazeley; and Sunil Aryal

arXiv:2507.21160·cs.LG·July 30, 2025

Handling Out-of-Distribution Data: A Survey

Lakpa Tamang, Mohamed Reda Bouadjenek, Richard Dazeley, and Sunil Aryal

PDF

TL;DR

This survey reviews the challenges of distribution shifts in machine learning, formalizes different types of shifts, critiques existing methods, and discusses future research directions for handling out-of-distribution data.

Contribution

It provides a comprehensive formalization of distribution shifts, critiques current methods, and highlights overlooked aspects of out-of-distribution data in existing literature.

Findings

01

Formalization of covariate and concept shifts

02

Analysis of limitations in conventional methods

03

Proposed future research directions

Abstract

In the field of Machine Learning (ML) and data-driven applications, one of the significant challenge is the change in data distribution between the training and deployment stages, commonly known as distribution shift. This paper outlines different mechanisms for handling two main types of distribution shifts: (i) Covariate shift: where the value of features or covariates change between train and test data, and (ii) Concept/Semantic-shift: where model experiences shift in the concept learned during training due to emergence of novel classes in the test phase. We sum up our contributions in three folds. First, we formalize distribution shifts, recite on how the conventional method fails to handle them adequately and urge for a model that can simultaneously perform better in all types of distribution shifts. Second, we discuss why handling distribution shifts is important and provide an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.