A Comparison of SVM against Pre-trained Language Models (PLMs) for Text   Classification Tasks

Yasmen Wahba; Nazim Madhavji; John Steinbacher

arXiv:2211.02563·cs.CL·November 7, 2022·5 cites

A Comparison of SVM against Pre-trained Language Models (PLMs) for Text Classification Tasks

Yasmen Wahba, Nazim Madhavji, John Steinbacher

PDF

Open Access

TL;DR

This study compares the effectiveness of pre-trained language models versus traditional SVM classifiers with TFIDF features for text classification, finding that SVM can often outperform PLMs in both cost and accuracy.

Contribution

The paper provides a systematic comparison showing that simple SVM classifiers can match or outperform complex PLMs in text classification tasks, especially in domain-specific contexts.

Findings

01

PLMs do not significantly outperform SVMs on tested datasets.

02

SVM with TFIDF features can be more cost-effective and sometimes more accurate.

03

Traditional methods remain competitive against modern PLMs in certain NLP tasks.

Abstract

The emergence of pre-trained language models (PLMs) has shown great success in many Natural Language Processing (NLP) tasks including text classification. Due to the minimal to no feature engineering required when using these models, PLMs are becoming the de facto choice for any NLP task. However, for domain-specific corpora (e.g., financial, legal, and industrial), fine-tuning a pre-trained model for a specific task has shown to provide a performance improvement. In this paper, we compare the performance of four different PLMs on three public domain-free datasets and a real-world dataset containing domain-specific words, against a simple SVM linear classifier with TFIDF vectorized text. The experimental results on the four datasets show that using PLMs, even fine-tuned, do not provide significant gain over the linear SVM classifier. Hence, we recommend that for text classification…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

MethodsSupport Vector Machine