Do BERT-Like Bidirectional Models Still Perform Better on Text Classification in the Era of LLMs?

Junyan Zhang; Yiming Huang; Shuliang Liu; Yubo Gao; Xuming Hu

arXiv:2505.18215·cs.CL·May 27, 2025

Do BERT-Like Bidirectional Models Still Perform Better on Text Classification in the Era of LLMs?

Junyan Zhang, Yiming Huang, Shuliang Liu, Yubo Gao, Xuming Hu

PDF

Open Access

TL;DR

This paper compares BERT-like models and LLMs in text classification, finding BERT-like models often outperform LLMs on certain tasks, and proposes a task-specific model selection strategy.

Contribution

It systematically evaluates BERT-like models versus LLMs across multiple datasets and introduces TaMAS, a task-driven model selection approach.

Findings

01

BERT-like models outperform LLMs on pattern-driven tasks.

02

LLMs excel in tasks requiring deep semantics or world knowledge.

03

TaMAS improves task-specific model selection.

Abstract

The rapid adoption of LLMs has overshadowed the potential advantages of traditional BERT-like models in text classification. This study challenges the prevailing "LLM-centric" trend by systematically comparing three category methods, i.e., BERT-like models fine-tuning, LLM internal state utilization, and zero-shot inference across six high-difficulty datasets. Our findings reveal that BERT-like models often outperform LLMs. We further categorize datasets into three types, perform PCA and probing experiments, and identify task-specific model strengths: BERT-like models excel in pattern-driven tasks, while LLMs dominate those requiring deep semantics or world knowledge. Based on this, we propose TaMAS, a fine-grained task selection strategy, advocating for a nuanced, task-driven approach over a one-size-fits-all reliance on LLMs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Mathematics, Computing, and Information Processing

MethodsPrincipal Components Analysis