Advancing Single and Multi-task Text Classification through Large Language Model Fine-tuning

Hang Zhao; Qile P. Chen; Yijing Barry Zhang; and Gang Yang

arXiv:2412.08587·cs.CL·May 13, 2025·3 cites

Advancing Single and Multi-task Text Classification through Large Language Model Fine-tuning

Hang Zhao, Qile P. Chen, Yijing Barry Zhang, and Gang Yang

PDF

Open Access

TL;DR

This paper systematically compares encoder-only models and large language models (LLMs) for text classification, demonstrating that fully fine-tuned LLMs outperform traditional models and can be effectively combined for multi-task learning.

Contribution

It provides a comprehensive benchmark of encoder and LLMs on text classification and introduces a method to combine fine-tuned LLMs for multi-task efficiency.

Findings

01

Fine-tuned Llama3-70B outperforms RoBERTa-large across tasks.

02

Multi-task fine-tuned LLMs match dual-model setups.

03

Combined LLMs reduce latency with maintained performance.

Abstract

Both encoder-only models (e.g., BERT, RoBERTa) and large language models (LLMs, e.g., Llama3) have been widely used for text classification tasks. However, there is a lack of systematic studies comparing the performance of encoder-based models and LLMs in text classification, particularly when fine-tuning is involved. This study employed a diverse range of models and methods, varying in size and architecture, and including both fine-tuned and pre-trained approaches. We first assessed the performances of these LLMs on the 20 Newsgroups (20NG) and MASSIVE datasets, comparing them to encoder-only RoBERTa models. Additionally, we explored the multi-task capabilities of both model types by combining multiple classification tasks, including intent detection and slot-filling, into a single model using data from both datasets. Our results indicate that fully fine-tuned Llama3-70B models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Softmax · Linear Layer · Linear Warmup With Linear Decay · Multi-Head Attention · Weight Decay · WordPiece · Layer Normalization · RoBERTa