Comparing BERT against traditional machine learning text classification

Santiago Gonz\'alez-Carvajal; Eduardo C. Garrido-Merch\'an

arXiv:2005.13012·cs.CL·April 26, 2023·186 cites

Comparing BERT against traditional machine learning text classification

Santiago Gonz\'alez-Carvajal, Eduardo C. Garrido-Merch\'an

PDF

Open Access

TL;DR

This paper empirically compares BERT with traditional machine learning methods like TF-IDF and classical NLP approaches, demonstrating BERT's superior performance and robustness across various NLP classification tasks.

Contribution

It provides empirical evidence supporting BERT as a default choice for NLP classification tasks over traditional methods.

Findings

01

BERT outperforms traditional TF-IDF-based classifiers.

02

BERT's performance is consistent across different languages.

03

Empirical support for adopting BERT as a standard NLP tool.

Abstract

The BERT model has arisen as a popular state-of-the-art machine learning model in the recent years that is able to cope with multiple NLP tasks such as supervised text classification without human supervision. Its flexibility to cope with any type of corpus delivering great results has make this approach very popular not only in academia but also in the industry. Although, there are lots of different approaches that have been used throughout the years with success. In this work, we first present BERT and include a little review on classical NLP approaches. Then, we empirically test with a suite of experiments dealing different scenarios the behaviour of BERT against the traditional TF-IDF vocabulary fed to machine learning algorithms. Our purpose of this work is to add empirical evidence to support or refuse the use of BERT as a default on NLP tasks. Experiments show the superiority of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies

MethodsLinear Layer · Weight Decay · Softmax · Adam · Multi-Head Attention · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Linear Warmup With Linear Decay · Dense Connections