Text Classification Under Class Distribution Shift: A Survey

Adriana Valentina Costache; Silviu Florin Gheorghe; Eduard Gabriel Poesina; Paul Irofti; Radu Tudor Ionescu

arXiv:2502.12965·cs.CL·January 16, 2026

Text Classification Under Class Distribution Shift: A Survey

Adriana Valentina Costache, Silviu Florin Gheorghe, Eduard Gabriel Poesina, Paul Irofti, Radu Tudor Ionescu

PDF

Open Access 1 Video

TL;DR

This survey reviews methods for text classification under distribution shifts, focusing on open-set, zero-shot, and continual learning approaches to handle evolving topics and classes in real-world scenarios.

Contribution

It categorizes existing techniques based on different distribution shift constraints and discusses future research directions in open-set text classification.

Findings

01

Mitigation strategies vary by problem setup

02

Continual learning can address distribution shift issues

03

The survey provides a comprehensive overview of current methods

Abstract

The basic underlying assumption of machine learning (ML) models is that the training and test data are sampled from the same distribution. However, in daily practice, this assumption is often broken, i.e. the distribution of the test data changes over time, which hinders the application of conventional ML models. One domain where the distribution shift naturally occurs is text classification, since people always find new topics to discuss. To this end, we survey research articles studying open-set text classification and related tasks. We divide the methods in this area based on the constraints that define the kind of distribution shift and the corresponding problem formulation, i.e. learning with the Universum, zero-shot learning, and open-set learning. We next discuss the predominant mitigation approaches for each problem setup. We further identify several future work directions,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Text Classification Under Class Distribution Shift: A Survey· underline

Taxonomy

TopicsText and Document Classification Technologies · Spam and Phishing Detection