Generative or Discriminative? Revisiting Text Classification in the Era of Transformers

Siva Rajesh Kasa; Karan Gupta; Sumegh Roychowdhury; Ashutosh Kumar; Yaswanth Biruduraju; Santhosh Kumar Kasa; Nikhil Priyatam Pattisapu; Arindam Bhattacharya; Shailendra Agarwal; Vijay huddar

arXiv:2506.12181·cs.LG·January 12, 2026

Generative or Discriminative? Revisiting Text Classification in the Era of Transformers

Siva Rajesh Kasa, Karan Gupta, Sumegh Roychowdhury, Ashutosh Kumar, Yaswanth Biruduraju, Santhosh Kumar Kasa, Nikhil Priyatam Pattisapu, Arindam Bhattacharya, Shailendra Agarwal, Vijay huddar

PDF

Open Access 1 Video

TL;DR

This paper compares generative and discriminative text classifiers in the transformer era, analyzing their performance, efficiency, and robustness to guide practical model selection.

Contribution

It provides the first comprehensive evaluation of modern generative and discriminative transformer-based classifiers across multiple criteria.

Findings

01

Classical 'two regimes' phenomenon varies across architectures

02

Generative models show different sample efficiency and robustness

03

Guidelines for choosing models based on real-world constraints

Abstract

The comparison between discriminative and generative classifiers has intrigued researchers since Efron's seminal analysis of logistic regression versus discriminant analysis. While early theoretical work established that generative classifiers exhibit lower sample complexity but higher asymptotic error in simple linear settings, these trade-offs remain unexplored in the transformer era. We present the first comprehensive evaluation of modern generative and discriminative architectures - Auto-regressive modeling, Masked Language Modeling, Discrete Diffusion, and Encoders for text classification. Our study reveals that the classical 'two regimes' phenomenon manifests distinctly across different architectures and training paradigms. Beyond accuracy, we analyze sample efficiency, calibration, noise robustness, and ordinality across diverse scenarios. Our findings offer practical guidance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Generative or Discriminative? Revisiting Text Classification in the Era of Transformers· underline

Taxonomy

TopicsAuthorship Attribution and Profiling